STIC-Biotech/ChemUb 



From: 

Sent: 

To: 

Subject: 



F^ussgI JstfrGy 

Wednesday. December 14, 2005 9:09 AM 
STIC-Biotech/ChemLib 

Database Search Request, Serial Number: 10/789,494 



Requester: 

Jeffrey Russel (TC1600) 
Art Unit: 

1654 

Employee Number: 

62785 
Office Location: 

REM 3D19 
Phone_Number : 

571-272-0969 
Mailbox Number: 

REM 3C18 



o 



12 



Case serial number: 

10/789,494 
Class / Subclass (es) : 

NA 

Earliest Priority Filing Date: 
NA 

Format preferred for results: 

Diskette 
Search Topic Information: 

Please search SEQ ID NOS : 3 , 1 , and 8 in the U.S. patent application 
sequence databases (pending, published, and issued) and in 
Geneseq/Uniprot/Pir . Thank you. 
Special Instructions and Other Comments: 



4c t * 4c * :tc * 9k « * * 3|c * ♦ * 4c * * * * * 

Searcher: 

Searcher Phone: 

Date Searcher Picked up:_ 

Date completed: 

Searcher Prep Time: 

Online Tinne: 



******* 4e 3tc 9|c 9|e * )tc * 

Type of Search 

NA# AA#: 

S/L: Oligomer: 

Encode/TransI : 

Structure #: ^Text: 

Inventor: Litigation: 



******************** 

Vendors and cost where applicable 

STN: 

DIAL0G:_ 

QUESTEL/ORBn": 

LEXIS/N EXIS : 

SEQUENCE SYSTEM: 

WWW/Internet: 

Other (Spedfy): 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd, 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Secjuence: 

Scoring table: 



December 16, 2005, 23:08:17 ; Search time 143.75 Seconds 

(without alignments) 
70.301 Million cell updates/sec 

US-10-789-494B-3 
109 

1 AASSVSSASSRSYDYSRRNVRKN 23 



BLOSUM62 
Gapop 10.0 



Gapext 0 . 5 

Searched: 2443163 seqs, 439378781 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : 



2443163 



A_Geneseq_2 1 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3 : geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2002s : * 

6 : geneseqp2003as : * 

7 : geneseqp2 003bs : * 

8: geneseqp2004s : * 

9: geneseqp2005s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



) . 


Score 


Match 


Length 


DB 


ID 


Description 


1 


109 


100.0 


23 


8 


ADU51229 


Adu51229 Domestic 


2 


109 


100.0 


23 


8 


ADU51207 


Adu51207 Silkworm 


3 


109 


100.0 


43 


8 


ADU51169 


Adu51169 Domestic 


4 


73 


67.0 


654 


9 


ADZ09405 


Adz09405 Canine pa 


5 


69 


63.3 


33 


9 


AEB30830 


Aeb30830 Spider th 


6 


50 


45.9 


271 


3 


AAY57719 


Aay57719 Spinach r 


7 


50 


45.9 


663 


2 


AAW29150 


Aaw29150 Dual -spec 


8 


50 


45.9 


663 


5 


ABR52351 


Abr52351 Protein r 



9 


50 


45 


. 9 


663 


5 


ADI17164 


Aclil7164 


Murine NO 


10 


50 


45 


. 9 


1059 


4 


ABB71639 


Abb71639 


Drosophil 


11 . 


50 


45 


.9 


1059 


8 


ADS96722 


Ads96722 


Drosophil 


12 


49 


45 


.0 


383 


2 


AAR51059 


Aar51059 


Sequence 


13 


48 


44 


.0 


472 


8 


ADL27854 


Adl27854 


A vitis i 


14 


47 . 5 


43 


.6 


674 


6 


ABR53678 


Abr53678 


Protein s 


15 


47.5 


43, 


.6 


674 


7 


ADK64 078 


Adk64078 


Disease t 


16 


46 


42 


.2 


117 


4 


AAU32566 


Aau32566 


Novel hum 


17 


46 


42 


.2 


277 


3 


AAG20058 


Aag2 0058 


Arabidops 


18 


46 


42 , 


.2 


278 


3 


AAG20057 


Aag20057 


Arabidops 


19 


46 


42 , 


.2 


359 


3 


AAG20056 


Aag20056 


Arabidops 


20 


45 


41 


.3 


50 


4 


ABB22229 


Abb2222 9 


Protein # 


21 


45 


41 


.3 


50 


4 


AAM70055 


Aam70055 


Human bon 


22 


45 


41 , 


.3 


50 


4 


AAM57648 


Aam57648 


Human bra 


23 


45 


41, 


.3 


50 


4 


ABG51748 


Abg51748 


Human 1 i v 


24 


45 


41. 


.3 


50 


4 


AAM05532 


Aam05532 


Peptide # 


25 


45 


41, 


.3 


50 


5 


ABG3 9686 


Abg39686 


Human pep 


26 


45 


41, 


.3 


146 


7 


ADG74666 


Adg74666 


Human kin 


27 


45 


41, 


.3 


235 


7 


ADG74665 


Adg74665 


Human kin 


28 


45 


41. 


.3 


317 


4 


ABG24218 


Abg24218 


Novel hum 


29 


45 


41. 


.3 


388 


7 


ADC15643 


Adcl5643 


E. intest 


30 


45 


41 , 


.3 


455 


8 


ABM83 064 


Abm83064 


Human dia 


31 


45 


41. 


.3 


455 


8 


ABM83 061 


At)m83061 


Human dia 


32 


45 


41. 


.3 


486 


7 


ADE58782 


Ade58782 


Human Pro 


33 


45 


41, 


.3 


486 


8 


ADI82554 


Adi82554 


Human mod 


34 


45 


41, 


.3 


486 


8 


ADR97409 


Adr97409 


Human PPP 


35 


45 


41. 


.3 


486 


9 


ADX08183 


Adx08183 


Cycl in -de 


36 


45 


41. 


.3 


487 


8 


ABM83 063 


Abm83063 


Human dia 


37 


45 


41. 


.3 


487 


8 


ABM83062 


Abm83062 


Human dia 


38 


45 


41, 


.3 


892 


7 


ADD18910 


Addl8910 


Human dis 


39 


45 


41. 


.3 


916 


7 


ADB64143 


Adb64143 


Human pro 


40 


45 


41. 


.3 


1002 


7 


ADC15645 


Adcl5645 


E. intest 


41 


45 


41 . 


.3 


1298 


7 


ADF60378 


Adf60378 


Human con 


42 


45 


41. 


.3 


1807 


8 


ADO17160 


Adol7160 


Novel exp 


43 


44.5 


40. 


.8 


1677 


8 


AD017159 


Adol7159 


Novel exp 


44 


44 


40. 


.4 


91 


5 


ADK34170 


Adk34170 


Novel hum 


45 


44 


40. 


,4 


215 


8 


ADH75876 


Adh75876 


Lucerne C 



ALIGNMENTS 



RESULT 1 

ADU5122 9 

ID ADU51229 standard; peptide; 23 AA. 
XX 

AC ADU51229; 
XX 

DT 24-FEB-2005 (first entry) 
XX 

DE Domestic silkworm fibroin H-chain peptide fragment 14. 

XX 

KW vulnerary; cell proliferation; wound healing; cell adhesion; cosmetics; 

KW cell culture; fibroin. 

XX 

OS Bombyx mori . 
XX 



PN JP2004339189-A. 

XX 

PD 02-DEC-2004. 
XX 

PF 04-DEC-2003; 2003 JP-00406608 . 

XX 

PR 28-FEB-2003; 2003 JP-00055048 . 
XX 

PA (DOKU-) DOKURITSU GYOSEI HOJIN NCX3Y0 SEIBUTSU SH. 

PA (TSUB/) TSUBOUCHI K. 
XX 

DR WPI; 2004-827614/82. 
XX 

PT New peptide having excellent cell growth promoting activity, for use as a 

PT cell growth promoter, cell adhesion agent, wound healing-promoting agent, 

PT cosmetic and cell culture base material. 
XX 

PS Example 3; Page; 27pp; Japanese. 
XX 

CC The invention relates to a novel peptide having excellent cell growth 

CC promoting activity. The peptide of the invention demonstrates vulnerary 

CC activity and may be utilised as a cell growth promoter, cell adhesion 

CC agent, wound healing-promoting agent or cosmetic and cell culture base 

CC material. The current sequence is that of a domestic silkworm fibroin H- 

CC chain peptide fragment of the invention which is described as being 

CC amorphous . 
XX 

SQ Sequence 23 AA; 

Query Match 100.0%; Score 109; DB 8; Length 23; 

Best Local Similarity 100.0%; Pred. No. 3.1e-10; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 AASSVSSASSRSYDYSRRNVRKN 23 

Illllllllllllllllllllll 
Db 1 AASSVSSASSRSYDYSRRNVRKN 23 



RESULT 4 
ADZ09405 

ID ADZ09405 standard; protein; 654 AA. 
XX 

AC ADZ09405; 
XX 

DT 30-JUN-2005 (first entry) 
XX 

DE Canine parvovirus VP2 protein Seq 8 . 
XX 

KW vaccine; antigen; VP2; silk; immune stimulation; antibody production; 
KW virucide; antiparasitic; cytostatic; immunostimulant . 

XX 

OS Canine parvovirus. 
XX 

PN JP2005097229-A. 

XX 

PD 14-APR-2005. 



XX 

PF 03-DEC-2003; 2003 JP- 00405124 . 
XX 

PR 04-DEC-2002; 2002JP-00352084 . 

PR 05-SEP-2003; 2003JP-00314808 . 

XX 

PA (TORA ) TORAY IND INC. 
XX 

DR WPI; 2005-299766/31. 

DR N-PSDB; ADZ09400. 

XX 

PT Vaccine for preventing infection caused by virus, parasite, and tumor, 

PT comprises antigenic protein and component derived from silk thread. 
XX 

PS Disclosure; SEQ ID NO 12; 21pp; Japanese. 
XX 

CC This invention relates to a novel vaccine. Specifically, it refers to a 

CC vaccine that comprises an antigenic protein, in particular the canine- 

CC parvovirus VP2 protein and a component derived from silk thread. The 

CC present invention describes a recombinant silkworm engineered to produce 

CC this antigenic protein in a silk gland or silk thread, whereby the gene 

CC encoding the antigenic protein has been transduced into a chromosome 

CC using a vector to transfer this foreign gene. Administration of the 

CC vaccine activates the immune response and so induces production of 

CC antibodies against the antigenic canine-parvovirus VP2 protein in the 

CC blood of an animal. Note that the silk thread containing this antigenic 

CC protein is pulverized to form particles or is solubilized, hence the 

CC vaccine can be obtained with high purity in a cost effective manner. 

CC Accordingly, compositions derived thereof exhibit virucide, 

CC antiparasitic, cytostatic and immunostimulant activities. This 

CC polypeptide sequence is the canine parvovirus VP2 protein of the 

CC invention. NOTE: This protein is given as an embedded protein the 

CC sequence listing and is not further referred to in the specification. 

XX 

SQ Sequence 654 AA; 

Query Match 67.0%; Score 73; DB 9; Length 654; 

Best Local Similarity 100.0%; Pred. No. 0.0097; 

Matches 14; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 10 SRSYDYSRRNVRKN 23 

Illillllllllll 
Db 621 SRSYDYSRRNVRKN 634 



RESULT 5 
AEB30830 

ID AEB30830 standard; peptide; 33 AA. 
XX 

AC AEB3 0830; 
XX 

DT 06-OCT-2005 (first entry) 
XX 

DE Spider thread peptide #1. 
XX 

KW Silk; spider thread protein. 
XX 



OS Bombyx mori . 
XX 

PN W02005068495-A1. 
XX 

PD 28-JUL-2005. 

XX 

PF 12-JAN-2005; 2005WO- JP000619 . 
XX 

PR 13-JAN-2004; 2004 JP-00005489 . 

XX 

PA (TORA ) TORAY IND INC. 

PA (DUPO ) DU PONT DE NEMOURS & CO E I . 

XX 

PI Hiramatsu S, Moriyama H, Asaoka R, Morita K, Tanaka T, Yatnada K; 

PI Obrien JP, Fahnestock SR; 

XX 

DR WPI; 2005-522809/53. 

XX 

PT Silk thread useful for producing textile fabric and in aeronautical 

PT navigation, space exploration, has spider thread protein, produced by 

PT transducing gene encoding spider thread protein to silkworm having 

PT fibroin H- chain gene. 
XX 

PS Claim 16; SEQ ID NO 3; 48pp; Japanese. 
XX 

CC The invention relates to a silk thread comprising a spider thread 

CC protein, produced by a transducing gene encoding spider thread protein in 

CC a silkworm having a fibroin H-chain gene, without damaging the silkworm 

CC fibroin H-chain gene. The invention also relates to producing silk thread 

CC involving producing a transgenic silkworm and extracting silk thread from 

CC the transgenic silkworm. The silk thread is useful for producing a 

CC textile fabric and also useful in aeronautical navigation, space 

CC exploration, to produce clothing, towrope and medical thread, etc. The 

CC silk thread has high strength and elongation property. This sequence 

CC represents a spider thread peptide of the invention. 

XX 

SQ Sequence 33 AA; 

Query Match 63.3%; Score 69; DB 9; Length 33; 

Best Local Similarity 100.0%; Fred. No. 0.0013; 

Matches 13; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 11 RSYDYSRRNVRKN 23 

IMIIIIIIIIII 

Db 1 RSYDYSRRNVRKN 13 



Search completed: December 16, 2005, 23:46:08 
Job time : 146.75 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Riin on: 



December 16, 2005, 23:40:43 ; Search time 34.9107 Seconds 

(without alignments) 
54.469 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-789-494B-3 
109 

1 AASSVSSASSRSYDYSRRNVRKN 23 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



572060 seqs, 82675679 residues 



Total number of hits satisfying chosen parameters: 572060 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_AA: * 

1: /cgn2_6/ptodata/l/iaa/5_COMB.pep: * 

2 : /cgn2_6/ptodata/l/iaa/6_COMB.pep: * 

3 : /cgn2_6/ptodata/l/iaa/H_COMB.pep: * 

4 : /cgn2_6/ptodata/l/iaa/PCTUS_COMB.pep: * 

5 : /cgn2_6/ptodata/l/iaa/RE_COMB .pep : * 

6 : /cgn2_6/ptodata/l/iaa/backf ilesl .pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than pr equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


50 


45 


.9 


434 


2 


US- 


09 


-270 


-767-61286 


Sequence 


61286, A 


2 


50 


45 


.9 


1059 


2 


US- 


09 


-270 


-767-45764 


Sequence 


45764, A 


3 


46.5 


42 


7 


93 


2 


us- 


09 


-270 


-767-60930 


Sequence 


60930, A 


4 


46.5 


42 


7 


427 


2 


us- 


09 


-270 


-767-45426 


Sequence 


45426, A 


5 


45 


41 


3 


908 


2 


us- 


09 


-949 


-016-7580 


Sequence 


7580, Ap 


6 


45 


41 


3 


916 


2 


us- 


10 


-104 


-047-2297 


Sequence 


2297, Ap 


7 


44 


40 


.4 


490 


2 


us- 


09 


-976 


-594-314 


Sequence 


314, App 


8 


44 


40 


4 


490 


2 


us- 


09 


-905 


-999-23 


Sequence 


23, Appl 


9 


44 


40 


4 


508 


2 


us- 


09 


-949 


-016-8519 


Sequence 


8519, Ap 


10 


44 


40 


4 


508 


2 


us- 


09 


-949 


-016-8520 


Sequence 


8520, Ap 


11 


44 


40 


4 


638 


2 


us- 


10 


-104 


-047-2626 


Sequence 


2626, Ap 


12 


43.5 


39 


9 


235 


2 


us- 


09 


-248 


-796A-24586 


Sequence 


24586, A 


13 


43 


39 


4 


172 


2 


us- 


09 


-252 


-991A-20172 


Sequence 


20172, A 


14 


43 


39 


4 


223 


2 


us- 


09 


-270 


-767-38042 


Sequence 


38042, A 


15 


43 


39 


4 


223 


2 


us- 


09 


-270 


-767-53259 


Sequence 


53259, A 


16 


42 


38 


5 


141 


2 


us- 


09 


-252 


-991A-26722 


Sequence 


26722, A 


17 


42 


38. 


5 


168 


2 


us- 


09 


-107 


-532A-3748 


Sequence 


3748, Ap 


18 


42 


38. 


5 


272 


2 


us- 


09 


-107 


-532A-6938 


Sequence 


6938, Ap 



19 


42 


38 


.5 


353 


1 


US- 


-08 


-687 


-702-37 


Sequence 


37, Appl 


20 


42 


38 


.5 


412 


2 


us- 


-09 


-027 


-064-4 


Sequence 


4, Appli 


21 


42 


38 


.5 


412 


2 


us- 


-09 


-271 


-815-4 


Sequence 


4, Appli 


22 


42 


38 


.5 


605 


2 


us- 


-09 


-538 


-092-375 


Sequence 


375, App 


23 


42 


38 


.5 


2090 


2 


us- 


-09 


-538 


-092-1081 


Sequence 


1081, Ap 


24 


42 


38 


.5 


2120 


2 


us- 


-09 


-949 


-016-9768 


Sequence 


9768, Ap 


25 


41.5 


38 


.1 


185 


2 


us- 


-09 


-248 


-796A-27129 


Sequence 


27129, A 


26 


41.5 


38 


.1 


246 


2 


us- 


■09 


-104 


-678A-2 


Sequence 


2, Appli 


27 


41 . 5 


38 


. 1 


247 


2 


us- 


09 


-548 


-938A-12 


Sequence 


12, Appl 


28 


41 


37 


.6 


224 


1 


us- 


■08 


-248 


-466B-8 


Sequence 


8 , Appl i 


29 


41 


37 


6 


259 


2 


us- 


-09 


-270 


-767-45089 


Sequence 


45089, A 


30 


41 


37 


6 


285 


2 


us- 


•08 


-482 


-085B-20 


Sequence 


20, Appl 


31 


41 


37 


6 


365 


2 


us- 


09 


-270 


-767-44316 


Sequence 


44316, A 


32 


41 


37 


6 


380 


2 


us- 


09 


-270 


-767-43626 


Sequence 


43626, A 


33 


41 


37 


6 


394 


2 


us- 


09 


-769 


-787-41 


Sequence 


41, Appl 


34 


41 


37 


6 


528 


2 


us- 


09 


-270 


-767-61237 


Sequence 


61237, A 


35 


41 


37 


6 


530 


2 


us- 


09 


-949 


-016-9519 


Sequence 


9519, Ap 


36 


41 


37 


6 


648 


1 


us- 


08 


-817 


-436A-2 


Sequence 


2, Appli 


37 


41 


37 


6 


670 


2 


us- 


09 


-270 


-767-45719 


Sequence 


45719, A 


38 


41 


37 


6 


1032 


2 


us- 


09 


-489 


-039A-14025 


Sequence 


14025, A 


39 


41 


37 


6 


1046 


2 


us- 


09' 


-538 


-092-445 


Sequence 


445, App 


40 


40.5 


37, 


2 


100 


2 


us- 


09- 


-513 


-999C-6133 


Sequence 


6133, Ap 


41 


40.5 


37, 


2 


238 


2 


us- 


09- 


-632 


-570-15 


Sequence 


15, Appl 


42 


40.5 


37. 


2 


238 


2 


us- 


09- 


-632 


-575-45 


Sequence 


45, Appl 


43 


40 . 5 


37. 


2 


239 


2 


us- 


09- 


-216- 


-295-15 


Sequence 


15, Appl 


44 


40 


36. 


7 


119 


2 


us- 


08- 


-890- 


-865A-17 


Sequence 


17, Appl 


45 


40 


36. 


7 


148 


2 


us- 


09- 


-252- 


-991A-28329 


Sequence 


28329, A 



ALIGNMENTS 



RESULT 1 

US-09-270-767-61286 

; Sequence 61286, Application US/09270767 

; Patent No. 6703491 

; GENERAL INFORMATION: 

; APPLICANT: Hottiburger et al . 

; TITLE OF INVENTION: Nucleic acids and proteins of Drosophila melanogaster 

; FILE REFERENCE: File Reference: 7326-094 

; CURRENT APPLICATION NUMBER: US/09/270,767 

; CURRENT FILING DATE: 1999-03-17 

; NUMBER OF SEQ ID NOS : 62517 

; SOFTWARE: Patentin Ver. 2.0 

; SEQ ID NO 61286 

LENGTH: 434 

TYPE : PRT 

ORGANISM: Drosophila melanogaster 
US-09-270-767-61286 

Query Match 45.9%; Score 50; DB 2; Length 434; 

Best Local Similarity 45.5%; Pred. No. 8.7; 

Matches 10; Conservative 6; Mismatches 6; Indels 0; Gaps 0; 

Qy 1 AASSVSSASSRSYDYSRRNVRK 22 

:|:| II I III hh 

Db 360 SANSKSSGGNKMSDVSRRNIRE 381 



Search completed: December 17, 2005, 00:12:07 
Job time : 35.9107 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



December 16, 2005, 23:52:28 ; Search time 121.571 Seconds 

(without alignments) 
79.049 Million cell updates/sec 

US-10-789-494B-3 
109 

1 AASSVSSASSRSYDYSRRNVRKN 23 



BLOSUM62 
Gapop 10.0 



Gapext 0 . 5 



Searched: 1867569 seqs, 417829326 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1867569 



Pos t -proces s ing : 



Database 



Fred. 



Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Published_Applications AA Main:* 



/ cgn2_6 /p t oda t a / 1 /pubpaa / US 0 7_PUBC0MB . pep : * 
/ cgn2_6 /p t oda t a / 1 /pubpaa /US 0 8_PUBC0MB . pep : * 
/ cgn2_6 /p t oda t a / 1 /pubpaa /US 0 9_PUBC0MB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB.pep: * 
/cgn2_6/ptodata/l/pubpaa/USll_PUBC0MB.pep: * 



No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



Query 

Match Length DB 



ID 



Description 



1 


109 


100 


0 


23 


5 


US-10-789-494B-3 


2 


109 


100 


0 


23 


5 


US-10-789-494B-59 


3 


109 


100 


0 


43 


5 


US-10-789-494B-21 


4 


50 


45 


9 


663 


4 


US-10-072-012-700 


5 


50 


45 


9 


663 


5 


US-10-029-345A-40 



Sequence 3, Appli 
Sequence 59, Appl 
Sequence 21, Appl 
Sequence 700, App 
Sequence 40, Appl 



6 


50 


45 


.9 


1059 


6 


US- 


11 
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158754, 


8 


48 


44 


. 0 


1507 


4 


US- 


10 


-425 


-115 


-342056 
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9 


47 


43 
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13 


45 


41 


.3 


208 


4 


us- 


10 


-437 


-963 


-184293 
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Sequence 


54577, A 
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45 
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.3 
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Sequence 


177, App 
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Sequence 


184282, 
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45 
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Sequence 


2297, Ap 
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Sequence 
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20 


44.5 


40 


8 
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4 
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10 
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Sequence 
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21 


44 


40 


4 
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10 


-767 


-701 
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Sequence 


43958, A 


22 


44 


40 


4 
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5 
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10 


-732 


-923 
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Sequence 
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23 


44 


40 


4 
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4 
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10 


-421 


-138A-316 


Sequence 


316, App 


24 


44 


40 


4 
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3 
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09 


-864 


-761 
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Sequence 


46689, A 


25 


44 


40 


4 
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10 


-767 


-701 


-43959 


Sequence 
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26 


44 


40 


4 
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4 
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10 


-437 


-963 
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Sequence 


151972, 


27 


44 


40 


4 
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4 
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10 


-282 


-122A-65292 


Sequence 


65292, A 


28 


44 


40 


4 


490 


3 
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09 


-905 


-999 


-23 


Sequence 


23, Appl 


29 


44 


40 


4 


490 


4 
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10 


-267 


-502 


-354 


Sequence 


354, App 


30 


44 


40 


4 


490 


5 
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10 


-825 


-177 


-23 


Sequence 


23, Appl 


31 


44 


40 


4 


490 


6 
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11 


-029 


-470 


-10 


Sequence 


10, Appl 


32 


44 


40 


4 


534 


6 
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11 


-097 


-143 


-17595 


Sequence 


17595, A 


33 


44 


40 


4 


638 


4 
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10 


-104 


-047 


-2626 


Sequence 


2626, Ap 


34 


44 


40 


4 
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4 
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10 


-437 


-963 
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Sequence 


160584, 


35 


43.5 
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9 


92 
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10 
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-115 


-349762 


Sequence 


349762, 


36 


43.5 


39 


9 
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4 
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10 


-032 


-585 
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Sequence 


7789, Ap 


37 


43 


39 


4 
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4 
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10 
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-115 


-242444 


Sequence 


242444, 


38 


43 


39 


4 
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4 
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10 


-424 


-599 
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Sequence 


214039, 


39 


43 


39 


4 
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4 
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10 


-767 


-701 
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Sequence 


39778, A 


40 


43 


39 


4 
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4 
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10 


-424 
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-254434 


Sequence 


254434, 


41 


43 


39. 


4 
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5 
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10 


-739 


-930 


-8688 


Sequence 


8688, Ap 


42 


43 


39. 


4 
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5 
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10 


-732 


-923 


-16338 


Sequence 


16338, A 


43 


43 


39. 


4 
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4 
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10 


-425 


-114 


-42487 


Sequence 


42487, A 


44 


43 


39. 


4 


223 


5 


us- 


10 


-732 


-923 


-16168 


Sequence 


16168, A 


45 


43 


39. 


4 


223 


5 


us- 


10 


-732 


-923 


-16337 


Sequence 


16337, A 



ALIGNMENTS 



RESULT 1 

US-10-789-494B-3 

; Sequence 3, Application US/10789494B 

; Publication No. US20050143296A1 

; GENERAL INFORMATION: 

; APPLICANT: TSUBOUCHI , Kozo 

; APPLICANT: YAMADA, Hiromi 

; TITLE OF INVENTION: EXTRACTION AND UTILIZATION OF CELL 

; TITLE OF INVENTION: GROWTH- PROMOTING PEPTIDES FROM SILK PROTEIN 

; FILE REFERENCE: OPS 635 

; CURRENT APPLICATION NUMBER: US/lO/789 , 494B 
; CURRENT FILING DATE: 2004-02-27 



; PRIOR APPLICATION NUMBER: JP 2003-55048 
; PRIOR FILING DATE: 2003-02-28 
; NUMBER OF SEQ ID NOS : 85 
; SEQ ID NO 3 

LENGTH: 23 

TYPE: PRT 

ORGANISM: Bombyx mori 
US-10-789-494B-3 

Query Match 100.0%; Score 109; DB 5; Length 23; 

Best Local Similarity 100.0%; Pred. No. 3.3e-09; 
Matches 23; Conservative 0; Mismatches 0; Indels 

Qy 1 AASSVSSASSRSYDYSRRNVRKN 23 

llllllllllllllillllMII 
Db 1 AASSVSSASSRSYDYSRRNVRKN 23 



RESULT 4 

US-10-072-012-700 

Sequence 700, Application US/10072012 
Publication No. US20040033493A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Tchernev, Velizar 
Spytek, Kimberly 
Zerhusen, Bryan 
Patturajan, Meera 
Shimkets, Richard 
Li , Li 
Gangolli, 
Padigaru, 
Anderson, 
Rastelli, 



Esha 

Muralidhara 
David W. 
Luca 



Miller, Charles E. 
Gerlach, Valerie 
Taupier Jr, Raymond J 
Gusev, Vladimir Y. 
Colman, Steven D. 
Wolenc, Adam R. 
Pena, Carol E. A 
Furtak, Katarzyna 
Grosse, William M. 
Alsobrook II, John P. 
Lepley, Denis e M, 
Rieger, Daniel K. 
Burgess, Catherine E. 



APPLICANT: 

TITLE OF INVENTION: Proteins and Nucleic Acids Encoding Same 
FILE REFERENCE: 21402-258 

CURRENT APPLICATION NUMBER: US/lO/072 , 012 

CURRENT FILING DATE: 2002-01-31 

PRIOR APPLICATION NUMBER: 60/265,102 

PRIOR FILING DATE: 2001-01-30 

PRIOR APPLICATION NUMBER: 60/265,514 

PRIOR FILING DATE: 2001-01-31 

PRIOR APPLICATION NUMBER: 60/265,517 

PRIOR FILING DATE: 2001-01-31 



/ PRIOR APPLICATION NUMBER: 60/265,412 

/ PRIOR FILING DATE: 2001-01-31 

; PRIOR APPLICATION NUMBER: 60/265,395 

; PRIOR FILING DATE: 2001-01-31 

; PRIOR APPLICATION NUMBER: 60/266,406 

; PRIOR FILING DATE: 2001-02-02 

; PRIOR APPLICATION NUMBER: 60/266,767 

; PRIOR FILING DATE: 2001-02-05 

; PRIOR APPLICATION NUMBER: 60/267,057 

; PRIOR FILING DATE: 2001-02-07 

PRIOR APPLICATION NUMBER: 60/266,975 
; PRIOR FILING DATE: 2001-02-07 
; PRIOR APPLICATION NUMBER: 60/267,459 
; PRIOR FILING DATE: 2 001-02-08 

; Remaining Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS : 1391 
; SOFTWARE: Patentin Ver. 2.1 
; SEQ ID NO 700 

LENGTH: 663 

TYPE: PRT 

ORGANISM: Mus musculus 
US-10-072-012-700 



Query Match 45.9%; Score 50; DB 4; Length 663; 

Best Local Similarity 57.1%; Pred. No. 88; 

Matches 12; Conservative 4; Mismatches 5; Indels 0; Gaps 0; 

Qy 1 AASSVSSASSRSYDYSRRNVR 21 

::|| Ihll I I Ihll 
Db 588 SSSSSSSSSSSSSDLRRRDVR 608 



RESULT 11 

US- 10-437 -963 -169698 

Sequence 169698, Application US/10437963 
Publication No. US20040123343A1 
GENERAL INFORMATION: 
APPLICANT: La Rosa, Thomas J. 
APPLICANT: Kovalic, David K. 
APPLICANT: Zhou, Yihua 
APPLICANT: Cao, Yongwei 
APPLICANT: Wu, Wei 
APPLICANT: Boukharov, Andrey A. 
APPLICANT: Barbazuk, Brad 
APPLICANT: Li, Ping 

TITLE OF INVENTION: Rice Nucleic Acid Molecules and Other Molecules 
Associated With 

TITLE OF INVENTION: Plants and Uses Thereof for Plant Improvement 
FILE REFERENCE: 38 -21 (53221) B 
CURRENT APPLICATION NUMBER: US/lO/437 , 963 
CURRENT FILING DATE: 2003-05-14 
NUMBER OF SEQ ID NOS: 204 966 
SEQ ID NO 169698 
LENGTH: 459 
TYPE : PRT 



; ORGANISM: Oryza sativa 
FEATURE : 

OTHER INFORMATION: Clone ID: PAT_MRT453 0_68 093C.1 .pep . 
US-10-437-963-169698 



Query Match 42.2%; Score 46; DB 4; Length 459; 

Best Local Similarity 39.1%; Pred. No. 2.3e+02; 

Matches 9; Conservative 7; Mismatches 7; Indels 0; Gaps 0; 



Qy 1 AASSVSSASSRSYDYSRRNVRKN 23 

Mlhh-I h II 
Db 36 AASSLSASASLDADFDKKQFRHN 58 



Search completed: December 17, 2005, 00:17:32 
Job time : 122.571 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



December 16, 2005, 23:53:24 ; Search time 6.16071 Seconds 

(without alignments) 
25.151 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-789-494B-3 
109 

1 AASSVSSASSRSYDYSRRNVRKN 23 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



51470 seqs, 6736768 residues 



Total number of hits satisfying chosen parameters: 51470 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_AA_New: * 

1 : /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB .pep : * 

2 : /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB.pep: * 

3 : /cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB .pep : * 

4 : /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB .pep : * 
5: /cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB.pep: * 
6: /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep: * 
7 : /cgn2_6/ptodata/l/p\ibpaa/USll_NEW_PUB .pep : * 
8 : /cgn2_6/ptodata/l/piibpaa/US60_NEW_PUB .pep : * 



Pred. No. is the number of results predicted by chance to have a 



score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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100, App 


36 


35 


32. 


1 


297 


6 


US-10-467-657-7296 
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23, Appl 
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8540, Ap 
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Sequence 


410, App 
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Sequence 
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US-10-467-657-8534 


Sequence 


8534, Ap 



ALIGNMENTS 



RESULT 1 

US-10-467-657-4550 

; Sequence 4550, Application US/10467657 

; Publication No. US20050260581A1 

; GENERAL INFORMATION: 

; APPLICANT: CHIRON SpA 

; APPLICANT: FONTANA Maria Rita 

; APPLICANT: PIZZA Mariagrazia 

; APPLICANT: MASIGNANI Vega 

; APPLICANT: MONACI Elisabetta 

; TITLE OF INVENTION: GONOCOCCAL PROTEINS AND NUCLEIC ACIDS 

; FILE REFERENCE: 

; CURRENT APPLICATION NUMBER: US/lO/467 , 657 

; CURRENT FILING DATE: 2003-08-11 

; PRIOR APPLICATION NUMBER: GB-0103424.8 

; PRIOR FILING DATE: 2001-02-12 

; NUMBER OF SEQ ID NOS : 9218 

; SOFTWARE: SeqWin99, version 1.04 
; SEQ ID NO 4550 

LENGTH: 436 

TYPE : PRT 
; ORGANISM: Neisseria gonorrhoeae 
US-10-467-657-4550 

Query Match 40.4%; Score 44; DB 6; Length 436; 

Best Local Similarity 39.1%; Pred. No. 6.8; 

Matches 9; Conservative 8; Mismatches 6; Indels 0; Gaps 0; 

Qy 1 AASSVSSASSRSYDYSRRNVRKN 23 



Db 



332 AAAAKSNAGYKAYNQMRRFVKEN 3 




Search completed: December 17, 2005, 00:17:54 
Job time : 7.16071 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



December 16, 2005, 23:33:32 



; Search time 23.8214 Seconds 
(without alignments) 
92.899 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-789-494B-3 
109 

1 AASSVSSASSRSYDYSRRNVRKN 23 



Scoring table: BLOSUM62 

Gapop 10.0 



Gapext 0.5 



Searched : 



283416 seqs, 96216763 residues 



Total number of hits satisfying chosen parameters: 



283416 



Minimum DB seq length: 0 

Maximum DB seq length; 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : PIR_80:* 
1: pirl:* 
2: pir2:* 
3 : pir3 : * 
4: pir4:* 

Pred. No. is the number of results predicted by chance to have a 

score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No. 


Score 


Match Length 


DB 


ID 


Description 
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19 


44 


40 


4 


490 


2 


S53639 


protein kinase elk 


20 


44 


40 


4 


490 


2 


S70352 


protein kinase elk 


21 


44 


40 


4 


754 


1 


P1IV50 


RNA-directed RNA p 


22 


44 


40 


4 


1513 


2 


S45768 


mitotic spindle pr 


23 


43 


39 


4 


342 


2 


C84234 


phosphate regulate 


24 


43 


39 


4 


369 


2 


C88030 


protein F46F5 . 10 [ 


25 


43 


39 


4 


382 


2 


T04260 


hypothetical prote 


26 


43 


39 


4 


405 


2 


C42094 


bw3 protein - smut 


27 


43 


39 


4 


602 


2 


S58336 


probable membrane 


28 


43 


39 


4 


695 


2 


T13648 


mitosis initiation 


29 


43 


39 


4 


1108 


2 


A96623 


hypothetical prote 


30 


43 


39 


4 


1383 


2 


C96723 


hypothetical prote 


31 


43 


39 


4 


1465 


2 


D86478 


protein F1504 . 11 [ 


32 


42 


38, 


5 


90 


2 


S73849 


MG220 homolog F10_ 


33 


42 


38. 


5 


133 


2 


B30902 


probable membrane 



34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 



42 
42 

42 
42 
42 
42 
42 
42 
42 
42 
42 
42 



38.5 
38.5 
38 .5 
38.5 
38.5 
38 .5 
38.5 
38.5 
38 .5 
38 .5 
38 .5 
38.5 



181 
226 
275 
305 
333 
522 
551 
565 
605 
629 
923 
1270 



1 
2 
2 
2 
2 
2 
2 
1 
2 
2 
2 
2 



TXSPM 

A86316 

T48696 

A56554 

H96814 

T28113 

S66701 

HMIVTN 

S46833 

JG0195 

S50482 

T22615 



thioredoxin m prec 
protein T10O22.3 [ 
mRNA splicing fact 
transcription fact 
hypothetical prote 
hypothetical prote 
probable membrane 
hemagglutinin prec 
hypothetical prote 
protein kinase DYR 
hypothetical prote 
hypothetical prote 



ALIGNMENTS 



RESULT 1 
S52820 

hypothetical protein YPR007c - yeast (Saccharomyces cerevisiae) 

N;Altemate names: hypothetical protein LPZ7c; hypothetical protein YP9723.07c 

C; Species: Saccharomyces cerevisiae 

C;Date: 19-May-1995 #sequence_revision Ol-Sep-1995 #text_change 09-Jul-2004 
C;Accession: S52820; S59752 
R; Pearson, D. ; Bowman, S. 

submitted to the EMBL Data Library, April 1995 
A;Reference number: S52814 

A, -Access ion : S52820 
A; Molecule type: DNA 
A;Residues: 1-680 <PEA> 

A; Cross -references: UNIPROT: Q12188 ; UNIPARC:UPI000006B335 ; EMBL:Z48951; 
NID:g762999; PID : g763006 ; MIPS:YPR007c 
A; Experimental source: strain AB972 

R;Wang, Y. ; Ahmed, A.; Bussey, H.; Fortin, N.; Friesen, J,D.; Hall, J.; Storms, 
R.K.; Vo, D.H.; Winnett, E. 

submitted to the EMBL Data Library, July 1995 

A /Description: The sequence of Saccharomyces cerevisiae chromosome XVI right 
arm. 

A;Reference number: S59746 
A; Access ion: S59752 
A; Molecule type: DNA 
A/Residues : 1-680 <WAN> 

A; Cross-references: UNIPARC:UPI000006B335; EMBL:U31900; NID: gl276597 ; 

PID:g939741; MIPS:YPR007c 

C; Genetics: 

A; Gene: SGD:SP069 

A; Cross-references : SGD : S0006211 

A;Map position: 16R 

Query Match 50.5%; Score 55; DB 2; Length 680; 

Best Local Similarity 47.6%; Pred. No. 2.1; 

Matches 10; Conservative 7; Mismatches 4; Indels 0; Gaps 0; 

Qy 3 SSVSSASSRSYDYSRRNVRKN 23 

h lh|:||::| I- I I 
Db . 43 9 STQSSSSTRSHEYGRKSFRNN 459 



Search completed: December 16, 2005, 23:53:14 
Job time : 25.8214 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



December 16, 2005, 23:33:02 ; Search time 147.036 Seconds 

(without alignments) 
110.362 Million cell updates/sec 

US-10-789-494B-3 

109 

1 AASSVSSASSRSYDYSRRNVRKN 23 
BLOSUM62 

Gapop 10.0 , Gapext 0,5 



Searched: 2166443 seqs, 705528306 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



2166443 



Post-processing: 



Database : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

UniProt_05.80:* 
1 : uniprot_sprot : * 
2 : iiniprot_trembl : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 

Match Length DB 


ID 


Description 


1 


109 


100. 


.0 


5263 


1 


FIBH_BOMMO 


P05790 bombyx mori 


2 


55 


50, 


.5 


680 


2 


Q12188_YEAST 


Q1218 8 saccharomyc 


3 


50 


45. 


. 9 


271 


1 


RRFC SPIOL 


P82231 spinacia ol 


4 


50 


45. 


.9 


663 


1 


DUS8 MOUSE 


009112 mus musculu 


5 


50 


45. 


.9 


665 


2 


Q7TSZ9_MOUSE 


Q7tsz9 mus musculu 


6 


50 


45. 


.9 


1059 


2 


Q9VW51_DROME 


Q9vw51 drosophila 


7 


50 


45. 


.9 


1060 


2 


Q95VE6_DROME 


Q95ve6 drosophila 


8 


49 


45. 


.0 


158 


2 


Q8GPL9_METEX 


Q8gpl9 methyl obact 


9 


49 


45. 


, 0 


237 


2 


Q57W53_9TRYP 


Q57w53 trypanosoma 


10 


49 


45. 


,0 


341 


2 


Q54NX5_DICDI 


Q54nx5 dictyosteli 


11 


49 


45. 


,0 


351 


2 


Q6BL67_DEBHA 


Q6bl67 debaryomyce 



12 


49 


45 


.0 


383 


2 


Q07137__9IiACT 


Q07137 


lactococcus 


13 


49 


45, 


.0 


476 


2 


Q75 9N7_ASHG0 


Q759n7 


ashbya goss 


14 


49 


45. 


. 0 


820 


2 


Q4IP00_GIBZE 


Q4ip00 


gibberella 


15 


49 


45, 


.0 


1362 


2 


Q611V7_CAEBR 


Q611v7 


caenorhabdi 


16 


49 


45. 


.0 


1695 


2 


Q4P3I9_USTMA 


Q4p3i9 


ustilago ma 


17 


49 


45 . 


.0 


2303 


2 


Q4PF4 9 USTIVIA 


Q4pf49 


ustilago ma 


18 


48 


44 , 


. 0 


472 


2 


052491_AGRVI 


052491 


agrobacteri 


19 


48 


44 , 


.0 


472 


2 


Q9X5L2_AGRVI 


Q9x512 


agrobacteri 


20 


48 


44 , 


.0 


654 


2 


Q4Z078 PLABE 


Q4z078 


Plasmodium 


21 


48 


44 , 


.0 


679 


1 


UBC17 CAEEL 


Q11076 


caenorhabdi 


22 


48 


44 , 


.0 


1030 


2 


Q8H037_ORYSA 


Q8h037 


oryza sativ 


23 


48 


44 , 


.0 


1089 


2 


Q4QBW2_LEIMA 


Q4qbw2 


leishmania 


24 


47.5 


43, 


.6 


674 


2 


Q06629_YEAST 


Q06629 


saccharomyc 


25 


47 


43 , 


.1 


110 


2 


Q7TUE3_PROMP 


Q7tue3 


prochloroco 


26 


47 


43, 


. 1 


157 


2 


Q8GW40_ARATH 


Q8gw40 


arabidopsis 


27 


47 


43, 


.1 


306 


2 


Q99JI2_MOUSE 


Q99ji2 


mus musculu 


28 


47 


43. 


. 1 


393 


2 


Q8BNS4 MOUSE 


Q8bns4 


mus musculu 


29 


47 


43, 


.1 


434 


2 


Q4QAI2_LEIMA 


Q4qai2 


leishmania 


30 


47 


43, 


.1 


482 


2 


Q8BSN1_M0USE 


Q8bsnl 


mus musculu 


31 


47 


43, 


.1 


494 


2 


Q7RDC8 PLAYO 


Q7rdc8 


Plasmodium 


32 


47 


43, 


.1 


1041 


2 


023536_ARATH 


023536 


arabidopsis 


33 


47 


43 , 


.1 


1135 


2 


Q8S4Q0_ARATH 


Q8s4q0 


arabidopsis 


34 


47 


43. 


.1 


1147 


2 


Q8L843_ARATH 


Q81843 


arabidopsis 


35 


47 


43. 


.1 


1150 


2 


Q50PT4_ENTHI 


Q50pt4 


entamoeba h 


36 


47 


43. 


.1 


1232 


2 


Q8IEF5 PLAF7 


Q8ief5 


Plasmodium 


37 


47 


43. 


.1 


1256 


2 


023528_ARATH 


023528 


arabidopsis 


38 


47 


43. 


.1 


1267 


2 


Q54UP9_DICDI 


Q54up9 


dictyosteli 


39 


47 


43. 


. 1 


1526 


2 


Q7RFW3_PLAYO 


Q7rfw3 


Plasmodium 


40 


46.5 


42 . 


.1 


634 


2 


Q4FXZ9 LEIiVlA 


Q4fx2 9 


leishmania 


41 


46 


42 , 


,2 


265 


2 


Q5U926_9CALI 


Q5U926 


norovirus h 


42 


46 


42. 


.2 


371 


2 


Q8BZ78_MOUSE 


Q8bz78 


mus musculu 


43 


46 


42. 


.2 


386 


2 


054428_9IiACT 


054428 


lactococcus 


44 


46 


42. 


,2 


386 


2 


Q4 8610_9LACT 


Q48610 


lactococcus 


45 


46 


42. 


.2 


386 


2 


Q9FB63_LACLA 


Q9fb63 


lactococcus 



ALIGNMENTS 



RESULT 1 
FIBH_B0MM0 

ID FIBH_BOMMO STANDARD; PRT; 5263 AA. 

AC P05790; Q17220; Q26379; 

DT Ol-NOV-1988 (Rel . 09, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 13-SEP-2005 (Rel. 48, Last annotation update) 

DE Fibroin heavy chain precursor (Fib-H) (H- fibroin) . 

GN Name=FIBH; 

OS Bombyx mori (Silk moth) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Bombycoidea; 

OC Bomby c i da e ; Bombyx . 

OX NCBI_TaxID=7091 ; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RX MEDLINE=20330362; PubMed=10871375 ; DOI=10 . 1093/nar/28 . 12 . 2413 ; 
RA Zhou C.-Z., Gonfalonier i F. , Medina N., Zivanovic Y., Esnault C. , 



RA Yang T. , Jacquet M., Janin J., Duguet M., Perasso R., Li Z.-G.; 

RT "Fine organization of Bombyx mori fibroin heavy chain gene."; 

RL Nucleic Acids Res, 28:2413-2419(2000). 

RN [2] 

RP NUCLEOTIDE SEQUENCE OF 1-168. 

RX MEDLINE=80045039; PubMed=4 98286 ; DOI = 10 . 1016/0092-8674 (79) 90075-8 ; 

RA Tsujimoto Y., Suzuki Y. ; 

RT "The DNA sequence of Bombyx mori fibroin gene including the 5* 

RT flanking, mRNA coding, entire intervening and fibroin protein coding 

RT regions . " ; 

RL Cell 18:591-600(1979). 

RN [3] 

RP PARTIAL NUCLEOTIDE SEQUENCE. 

RX MEDLINE=79211211; PubMed=455439 ; DOI=10 . 1016/0092 -8674 (79) 90018-7 ; 

RA Tsuj imoto Y . , Suzuki Y . ; 

RT "Structural analysis of the fibroin gene at the 5* end and its 

RT surrounding regions . " ; 

RL Cell 16:425-436(1979). 

RN [4] 

RP PARTIAL NUCLEOTIDE SEQUENCE. 

RC STRAIN=Kinshu X Showa; 

RX MEDLINE=89094868 ; Pub!yied=3210244 ; 

RA Mita K. , Ichimura S., Zama M., James T.C.; 

RT "Specific codon usage pattern and its implications on the secondary 

RT structure of silk fibroin mRNA."; 

RL J. Mol. Biol. 203:917-925(1988). 

RN [5] 

RP PARTIAL NUCLEOTIDE SEQUENCE. 

RX MEDLINE=94365842; PubMed=7916056 ; DOI =10 . 1007/BF00175878 ; 

RA Mita K. , Ichimura S., James T.C.; 

RT "Highly repetitive structure and its organization of the silk fibroin 

RT gene . " ; 

RL J. Mol. Evol. 38:583-592(1994). 

RN [6] 

RP NUCLEOTIDE SEQUENCE OF 5179-5263, AND DISULFIDE BONDS. 

RC STRAIN=J-13 9; 

RX MEDLINE=99296390; PubMed=10366732 ; DOI =10 . 1016/S0167-4838 ( 99 ) 00088 -6 ; 

RA Tanaka K. , Kajiyama N., Ishikura K, , Waga S., Kikuchi A., Ohtomo K. , 

RA Takagi T., Mizuno S.; 

RT "Determination of the site of disulfide linkage between heavy and 

RT light chains of silk fibroin produced by Bombyx mori . " ; 

RL Biochim. Biophys . Acta 1432:92-103(1999). 

RN [7] 

RP PARTIAL NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RC STRAIN=Daizo; 

RX PubMed=:15591204; DOI = 10 . 1126/science . 1102210; 

RA Xia Q., Zhou Z., Lu C. , Cheng D., Dai F., Li B., Zhao P., Zha X., 

RA Cheng T. , Chai C. , Pan G. , Xu J., Liu C. , Lin Y. , Qian J., Hou Y. , 

RA Wu Z. , Li G. , Pan M. , Li C. , Shen Y. , Lan X. , Yuan L. , Li T. , Xu H. , 

RA Yang G. , Wan Y. , Zhu Y. , Yu M. , Shen W. , Wu D. , Xiang Z, , Yu J., 

RA Wang J., Li R. , Shi J., Li H., Li G. , Su J. , Wang X., Li G. , Zhang Z., 

RA Wu Q. , Li J., Zhang Q., Wei N., Xu J. , Sun H. , Dong L., Liu D., 

RA Zhao S., Zhao X., Meng Q. , Lan F., Huang X., Li Y. , Fang L. , Li C. , 

RA Li D., Sun Y. , Zhang Z. , Yang Z., Huang Y., Xi Y., Qi Q. , He D., 

RA Huang H. , Zhang X., Wang Z., Li W. , Cao Y. , Yu Y., Yu H., Li J. , 

RA Ye J., Chen H, , Zhou Y. , Liu B., Wang J., Ye J., Ji H., Li S., Ni P., 

RA Zhang J., Zhang Y., Zheng H., Mao B., Wang W. , Ye C, Li S., Wang J., 



Wong G.K. -S. , Yang H. ; 

"A draft sequence for the genome of the domesticated silkworm (Bombyx 
mori) . " ; 

Science 306:1937-1940(2004). 
[8] 

SUBUNIT. 

MEDLINE=20568317; PubMed=10986287 ; DOI =10 . 1074/ jbc . M006897200 ; 
Inoue S., Tanaka K. , Arisaka F,, Kimura S., Ohtomo K. , Mizuno S.; 
"Silk fibroin of Bombyx mori is secreted, assembling a high molecular 
mass elementary unit consisting of H-chain, L-chain, and p25, with a 
6:6:1 molar ratio."; 

J. Biol. Chem. 275:40517-40528(2000). 

-!- FUNCTION: Core component of the silk filament; a strong, insoluble 

and chemically inert fiber. 
-!- SUBUNIT: Silk fibroin elementary unit consists in a disulfide- 

linked heavy and light chain and a p25 glycoprotein in molar 

ratios of 6:6:1. This results in a complex of approximately 2.3 

MDa. 

-!- TISSUE SPECIFICITY: Produced exclusively in the posterior (PSG) 
section of silk glands, which are essentially modified salivary 

glands . 

DOMAIN: Composed of antiparallel beta sheets. The strands of the 
beta sheets run parallel to the fiber axis. Long stretches of silk 
fibroin are composed of microcrystalline arrays of ( -Gly-Ser-Gly- 
Ala-Gly-Ala-) n interrupted by regions containing bulkier residues. 
The fiber is composed of microcrystalline arrays alternating with 
amorphous regions . 
-I- PTM: The interchain disulfide bridge is essential for the 
intracellular transport and secretion of fibroin. 



This Swiss-Prot entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use as long as its content is in no way modified and this statement is not 
removed . 



EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 



AF226688; AAF76983.1; 
V00094; CAA23432.1 
CAA23433 
AAB31861 
CAA32076 



V00097; 
S74439; 
X13869; 



- ; Genomic_DNA . 
Genomic_DNA. 
Genomic_DNA. 
mRNA. 
mRNA, 
mRNA. 

- ; Genomic_DNA . 
CK538369; -; NOT_ANNOTATED_CDS ; mRNA. 
AADK01000575; -; NOT_ANNOTATED_CDS ; Genomic_DNA. 
PIR; S01844; S01844 . 
Repeat; Signal; Silk. 



M35378; AAA27839 



AB017362; BAA33147.1; 



SIGNAL 

CHAIN 

REGION 

DISULFID 

DISULFID 

CONFLICT 

SEQUENCE 



1 
22 
149 
5244 
5260 
10 



21 
5263 
5206 
5244 
5263 

10 



5263 AA; 391593 



Potential . 

Fibroin heavy chain. 
Highly repetitive. 

Interchain (with C-190 in light chain) . 

C -> V (in Ref . 2) . 
MW; 8EE11D3A0A47440E CRC64 ; 



Query Match 100.0%; Score 109; DB 1; Length 5263; 

Best Local Similarity 100.0%; Pred. No. 5.3e-07; 



Matches 23; Conservative 0; Mismatches 

Qy 1 A?VSSVSSASSRSYDYSRRNVRKN 23 

IIIIMIIIIIIIIIIIIIIIII 
Db 5221 AASSVSSASSRSYDYSRRNVRKN 5243 



Search completed: December 16, 2005, 23:52:12 
Job time : 150.036 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



December 16, 2005, 23:08:17 ; Search time 125 Seconds 

(without alignments) 

70.301 Million cell updates/sec 

US-10-789-494B-7 
108 

1 VETIVLEEDPYGHEDIYEED 20 



2443163 



Scoring table: BLOSUiyi62 

Gapop 10.0 , Gapext 0.5 

Searched: 2443163 seqs, 439378781 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : A_Geneseq_21 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2002s : * 

6 : geneseqp2003as : * 

7 : geneseqp2003bs : * 

8: geneseqp2004s : * 

9: geneseqp2005s:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



1 


108 


100 


.0 


20 


8 


ADU51237 


Adu51237 


2 


108 


100 


.0 


20 


8 


ADU51211 


Adu51211 


3 


108 


100 


0 


120 


8 


ADU51171 


Adu51171 


4 


108 


100 


0 


2655 


7 


AD059401 


Ado59401 


5 


50 


46 


3 


1100 


5 


ABB50120 


Abbs 0120 


6 


49 


45 


4 


299 


9 


ABM93718 


Abm93718 


7 


49 


45 


4 


325 


3 


AAB07672 


Aab07672 


8 


49 


45 


4 


325 


8 


ADU66520 


Adu66520 



Gut silkw 
Silkworm 
Gut silkw 
Antheraea 
Listeria 
M . xanthu 
Amino aci 
TaD, memb 



9 


49 


45 


.4 


325 


9 


ADZ44577 


Adz44577 


Myxococcu 


10 


49 


45 


.4 


640 


6 


ABP71659 


Abp71659 


A. cellul 


11 


49 


45 


.4 


640 


7 


ADF75899 


Adf75899 


Catalytic 


12 


49 


45 


.4 


1121 


6 


ABP71656 


Abp71656 


A. cellul 


13 


49 


45 


.4 


1121 


7 


ADF75895 


Adf 75895 


Acidother 


14 


48 ,5 


44 


.9 


201 


7 


ADM2 542 6 


Adm25426 


Hyperther 


15 


48 


44 


.4 


55 


5 


ABG80758 


Abg80758 


C-tvpe le 


16 


48 


44 


.4 


159 


5 


ABB56404 


Abb56404 


Human col 


17 


48 


44 


.4 


277 


2 


AAY25518 


Aay25518 


Human col 


18 


48 


44 


.4 


277 


2 


AAY41698 


Aay41698 


Human PRO 


19 


48 


44 


.4 


277 


3 


AAB44254 


Aab44254 


Human PRO 


20 


48 


44 


.4 


277 


4 


AAU29073 


Aau29073 


Human PRO 


21 


48 


44 


.4 


277 


6 


ABU58449 


Abu58449 


Human PRO 


22 


48 


44 


.4 


277 


6 


ABU87997 


Abu87997 


Novel hum 


23 


48 


44 


.4 


277 


6 


ABU84312 


Abu84312 


Human sec 


24 


48 


44 


.4 


277 


6 


ABR66186 


Abr66186 


Human sec 


25 


48 


44 


.4 


277 


6 


ABR65576 


Abr65576 


Human sec 


26 


48 


44 


.4 


277 


6 


ABU99516 


Abu99516 


Human sec 


27 


48 


44, 


.4 


277 


6 


ABU82755 


Abu82755 


Human PRO 


28 


48 


44. 


.4 


277 


6 


ABU8 9876 


Abu89876 


Novel hum 


29 


48 


44, 


.4 


277 


6 


ABR68125 


Abr68125 


Human sec 


30 


48 


44 , 


.4 


277 


6 


ABU96178 


Abu96178 


Novel hum 


31 


48 


44 , 


.4 


277 


6 


ABU92609 


Abu92609 


Human sec 


32 


48 


44. 


.4 


277 


6 


ABO08686 


Abo08686 


Human sec 


33 


48 


44 . 


.4 


277 


6 


ABO02738 


Abo02738 


Human sec 


34 


48 


44 . 


.4 


277 


6 


ABR748 92 


Abr74892 


Human sec 


35 


48 


44. 


.4 


277 


6 


ABR94654 


Abr94654 


Human sec 


36 


48 


44. 


.4 


277 


6 


ABO252 00 


Abo25200 


Novel hum 


37 


48 


44 . 


.4 


277 


6 


ABU85627 


Abu85627 


Human PRO 


38 


48 


44 . 


.4 


277 


6 


ABU98787 


Abu98787 


Novel hum 


39 


48 


44. 


,4 


277 


6 


ABU98002 


Abu98002 


Novel hum 


40 


48 


44 . 


.4 


277 


6 


ABU91708 


Abu91708 


Novel hum 


41 


48 


44 . 


.4 


277 


6 


ABU72206 


Abu72206 


Novel hum 


42 


48 


44 . 


4 


277 


6 


ABU8 94 01 


Abu894 01 


Human PRO 


43 


48 


44. 


4 


277 


6 


ABU86242 


Abu86242 


Human sec 


44 


48 


44 . 


4 


277 


6 


ABU674 55 


Abu67455 


Human sec 


45 


48 


44. 


4 


277 


6 


ABU8 0483 


Abu8 0483 


Human PRO 



ALIGNMENTS 



RESULT 1 
ADU51237 

ID ADU51237 standard; peptide; 20 AA. 
XX 

AC ADU51237; 
XX 

DT 24-FEB-2005 (first entry) 
XX 

DE Gut silkworm fibroin peptide fragment 39, 
XX 

KW vulnerary; cell proliferation; wound healing; cell adhesion; cosmetics; 

KW cell culture; fibroin. 

XX 

OS Bombycoidea . 
XX 



PN JP2004339189-A. 
XX 

PD 02-DEC-2004. 
XX 

PF 04-DEC-2003; 2003 JP-004 06608 . 

XX 

PR 28-FEB-2003; 2003 JP-00055048 . 
XX 

PA (DOKU-) DOKURITSU GYOSEI HOJIN NOGYO SEIBUTSU SH. 

PA (TSUB/) TSUBOUCHI K. 
XX 

DR WPI; 2004-827614/82. 
XX 

PT New peptide having excellent cell growth promoting activity, for use as a 

PT cell growth promoter, cell adhesion agent, wound healing -promo ting agent, 

PT cosmetic and cell culture base material. 
XX 

PS Example 3; Page; 27pp; Japanese. 
XX 

CC The invention relates to a novel peptide having excellent cell growth 

CC promoting activity. The peptide of the invention demonstrates vulnerary 

CC activity and may be utilised as a cell growth promoter, cell adhesion 

CC agent, wound healing -promoting agent or cosmetic and cell culture base 

CC material. The current sequence is that of a gut silkworm fibroin peptide 

CC fragment of the invention which is described as being amorphous. 
XX 

SQ Sequence 20 AA; 

Query Match 100.0%; Score 108; DB 8; Length 20; 

Best Local Similarity 100.0%; Pred. No. 3.6e-10; 

Matches 20; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 VETIVLEEDPYGHEDIYEED 20 

llllllllllllllllilll 
Db 1 VETIVLEEDPYGHEDIYEED 20 



RESULT 4 
ADO59401 

ID ADO59401 standard; protein; 2655 AA. 
XX 

AC ADOS 9401; 
XX 

DT 15-JUL-2004 (first entry) 
XX 

DE Antheraea yamamai fibroin. 
XX 

KW Japanese oak silkmoth; fibroin; silkworm; raw silk; thread. 
XX 

OS Antheraea yamamai. 
XX 

PN KR2002094304-A. 
XX 

PD 18-DEC-2002. 
XX 

PF 08-JUN-2001; 2001KR-00031954 . 



XX 

PR 08-JUN-2001; 2 OOIKR- 0003 1954 . 
XX 

PA (RURA-) RURAL DEV ADMINISTRATION. 
XX 

PI Hwang JS, Jin BR, Kang SU, Kim GY, Kim YS, Koo TW, Lee JS; 

PI Lee SM, Seo DS, Yoon EY; 

XX 

DR WPI; 2003-399638/38. 

DR N-PSDB; ADO59400. 
XX 

PT Nucleotide sequence encoding fibroin useful for transforming silkworms 

PT for producing raw silk thread comprises a gene isolated from Antheraea 

PT yamamai and its promoter. 
XX 

PS Example 1; SEQ ID NO 2; 19pp; Korean. 

XX 

CC This invention describes a novel nucleic acid isolated from Antheraea 

CC yamamai (Japanese oak silkmoth) which encodes fibroin. The nucleic acid 

CC is useful for producing transformed silkworms capable of producing raw 

CC silk-thread. 
XX 

SQ Sequence 2655 AA; 



Query Match 100.0%; Score 108; DB 7; Length 2655; 

Best Local Similarity 100.0%; Pred. No. l,3e-07; 

Matches 20 ; Conservative 0 ; Mismatches 0 ; Indels 0 ; Gaps 0 ; 



Qy 1 VETIVLEEDPYGHEDIYEED 2 0 

Illlllllllllllllllll 
Db 61 VETIVLEEDPYGHEDIYEED 80 



RESULT 5 
ABB50120 

ID ABB50120 Standard; protein; 1100 AA. 
XX 

AC ABBS 0120; 
XX 

DT 05-FEB-2002 (first entry) 
XX 

DE Listeria monocytogenes protein #2824. 
XX 

KW Antibacterial; gene therapy; vaccine; biosynthesis; biodegradation; 

KW vitamin B12; bacterial infection; disease. 

XX 

OS Listeria monocytogenes. 
XX 

PN WO200177335-A2. 
XX 

PD 18-OCT-2001. 
XX 

PF ll-APR-2001; 2001WO-FR001118 . 
XX 

PR ll-APR-2000; 2000FR-00004629 . 
XX 

PA (INSP ) INST PASTEUR. 



XX 

PI Buchrieser C, Frangeul L, Couve E, Rusniok C, Fsihi H, Dehoux P; 

PI Dussurget 0, Chetouani F, Nedjari H, Glaser P, Kunst F, Cossart P; 

PI Daniels J, Goebel W, Kreft J, Kuhn M, Ng E, Vazquez -Boland JA; 

PI Dominguez-Bernal G, Garrido-Garcia P, Tierrez -Martinez A, Amend A; 

PI Chakraborty T, Domann E, Hain T, Berche P, Charbit A, Durant L; 

PI Perez-Diaz J, Baquero F, Garcia Del Portillo F, Gomez-Lopez N; 

PI Maduenio E, De Pablos B, Wehland J, Kaerst U, Entian K, Hauf J; 

PI Rose M, Voss H; 
XX 

DR WPI; 2002-010914/01. 
XX 

PT Genomic sequence for Listeria monocytogenes, useful e.g, for treatment 

PT and prevention of Listeria and related bacterial infections, and related 

PT polypeptides. 
XX 

PS Claim 6; SEQ ID NO 2825; 192pp; French. 
XX 

CC The present invention relates to the genome sequence of Listeria 

CC monocytogenes EGD-e (see ABA03041) . The genome sequence and fragments of 

CC it are useful for selecting probes and primers for detecting genes in L. 

CC monocytogenes and related organisms, and for studying genetic 

CC polymorphisms and other genomes. The present sequence is a protein 

CC encoded by the genome sequence of the present invention. Proteins 

CC expressed from the genome sequence are useful for raising specific 

CC antibodies, identification of L. monocytogenes and related organisms, and 

CC for biosynthesis and biodegradation, especially biosynthesis of Vitamin 

CC B12 . The genome sequence and proteins encoded by it are also useful for 

CC selecting compounds that regulate gene expression and cell replication 

CC and modulate L. monocytogenes -related diseases. In addition, the genome 

CC sequence and proteins encoded by it are useful in pharmaceutical and 

CC vaccines compositions for the treatment or prevention of infections by L. 

CC monocytogenes and related organisms. Note: The sequence data for this 

CC patent did not form part of the printed specification, but was obtained 

CC in electronic format directly from WIPO at 

CC ftp.wipo.int/pub/published__pct_sequences 

XX 

SQ Sequence 1100 AA; 

Query Match 46.3%; Score 50; DB 5; Length 1100; 

Best Local Similarity 47.1%; Pred. No. 97; 

Matches 8; Conservative 4; Mismatches 5; Indels 0; Gaps 0; 

Qy 3 TIVLEEDPYGHEDIYEE 19 

I : Ihllh Ih 
Db 242 TAHISHDPFGHENRYEQ 258 



Search completed: December 16, 2005, 23:46:10 
Job time : 127 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Rion on: December 16, 2005, 23:40:43 ; Search time 30.3571 Seconds 

(without alignments) 
54.469 Million cell updates/sec 

Title: US-10-78 9-4 94B-7 

Perfect score: 108 

Sequence: 1 VETIVLEEDPYGHEDIYEED 20 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 572060 seqs, 82675679 residues 

Total number of hits satisfying chosen parameters: 572060 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/l/iaa/5_COMB . pep : * 

2 : /cgn2_6/ptodata/l/iaa/6_COMB.pep: * 

3 : /cgn2_6/ptodata/l/iaa/H_COMB.pep: * 

4 : /cgn2_6/ptodata/l/iaa/PCTUS_COMB.pep: * 

5 : /cgn2_6/ptodata/l/iaa/RE_COMB.pep: * 

6 : /cgn2_6/ptodata/l/iaa/backf ilesl .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


49 


45 


.4 


299 


2 


US- 


-09 


-902 


-540-12917 


Sequence 


12917, A 


2 


49 


45 


.4 


325 


2 


US- 


-09 


-710 


-262E-9 


Sequence 


9, Appli 


3 


48 


44 


.4 


277 


2 


US- 


-09 


-600 


-932-2 


Sequence 


2, Appli 


4 


48 


44 


4 


277 


2 


us- 


-09 


-999 


-833A-97 


Sequence 


97, Appl 


5 


48 


44 


4 


277 


2 


us- 


■10 


-020 


-445A-97 


Sequence 


97, Appl 


6 


47 


43 


5 


562 


2 


us- 


-09 


-949 


-002-426 


Sequence 


426, App 


7 


47 


43 


5 


636 


2 


us- 


09 


-564 


-805-237 


Sequence 


237, App 


8 


47 


43 


5 


726 


2 


us- 


09 


-187 


-330-57 


Sequence 


57, Appl 


9 


47 


43 


5 


781 


2 


us- 


09 


-187 


-330-32 


Sequence 


32, Appl 


10 


47 


43 


5 


787 


2 


us- 


09 


-187 


-330-31 


Sequence 


31, Appl 


11 


47 


43 


5 


800 


2 


us- 


09 


-187 


-330-41 


Sequence 


41, Appl 


12 


47 


43, 


5 


806 


2 


us- 


09 


-187 


-330-3 


Sequence 


3, Appli 


13 


47 


43. 


5 


828 


2 


us- 


09 


-187- 


-330-55 


Sequence 


55, Appl 


14 


47 


43. 


5 


874 


2 


us- 


09 


-187- 


-330-59 


Sequence 


59, Appl 


15 


47 


43. 


5 


937 


1 


us- 


08- 


-253- 


-155A-31 


Sequence 


31, Appl 


16 


47 


43. 


5 


937 


2 


us- 


09- 


-538- 


'092-1092 


Sequence 


1092, Ap 



17 


47 


43 


5 


937 


2 


US- 


09 


-949 


-002-374 


Sequence 


374, App 


18 


47 


43 


5 


959 


2 


us- 


09 


-538 


-092-1091 


Sequence 


1091, Ap 


19 


47 


43 


5 


1000 


2 


us- 


09 


-187 


-330-1 


Sequence 


1, Appli 


20 


47 


43 


5 


1102 


2 


us- 


09 


-364 


-609-8 


Sequence 


8, Appli 


21 


46.5 


43 


1 


289 


2 


us- 


09 


-902 


-540-10736 


Sequence 


10736, A 


22 


46 


42 


6 


276 


2 


us- 


09 


-902 


-540-16693 


Sequence 


16693, A 


23 


45 


41 


7 


512 


2 


us- 


09 


-949 


-016-10977 


Sequence 


10977, A 


24 


45 


41 


7 


513 


1 


us- 


08 


-357 


-533A-11 


Sequence 


11, Appl 


25 


45 


41 


7 


513 


1 


us- 


08 


-459 


-009-11 


Sequence 


11, Appl 


26 


45 


41 


7 


513 


2 


us- 


08 


-459 


-951-11 


Sequence 


11, Appl 


27 


45 


41 


7 


799 


2 


us- 


08 


-909 


-954-4 


Sequence 


4, Appli 


28 


45 


41 


7 


804 


2 


us- 


08 


-909 


-954-2 


Sequence 


2, Appli 


29 


44 


40 


7 


331 


2 


us- 


09 


-248 


-796A-18164 


Sequence 


18164, A 


30 


44 


40 


7 


357 


1 


us- 


08 


-978 


-182-4 


Sequence 


4, Appli 


31 


44 


40 


7 


357 


1 


us- 


09 


-205 


-681-4 


Sequence 


4, Appli 


32 


44 


40 


7 


373 


2 


us- 


09 


-248 


-796A-20443 


Sequence 


20443, A 


33 


44 


40 


7 


850 


2 


us- 


09 


-583 


-110-4394 


Sequence 


4394, Ap 


34 


44 


40 


7 


861 


2 


us- 


09 


-107 


-433-2860 


Sequence 


2860, Ap 


35 


43 


39 


8 


238 


2 


us- 


09 


-489 


-039A-9218 


Sequence 


9218, Ap 


36 


43 


39 


8 


302 


2 


us- 


09 


-438 


-185A-457 


Sequence 


4 57, App 


37 


43 


39 


8 


324 


2 


us- 


09 


-248 


-796A-15787 


Sequence 


15787, A 


38 


43 


39 


8 


342 


2 


us- 


09 


-248 


-796A-14202 


Sequence 


14202, A 


39 


43 


39 


8 


357 


2 


us- 


09 


-198 


-452A-487 


Sequence 


487, App 


40 


43 


39 


8 


455 


2 


us- 


09 


-543 


-681A-8288 


Sequence 


8288, Ap 


41 


43 


39 


8 


536 


1 


us- 


08 


-357 


-533A-12 


Sequence 


12, Appl 


42 


43 


39 


8 


536 


1 


us- 


08 


-459 


-009-12 


Sequence 


12, Appl 


43 


43 


39 


8 


536 


2 


us- 


08 


-459 


-951-12 


Sequence 


12, Appl 


44 


43 


39 


8 


536 


2 


us- 


09 


-267 


-963D-35 


Sequence 


35, Appl 


45 


43 


39 


8 


817 


1 


us- 


08 


-381 


-931B-2 


Sequence 


2, Appli 



ALIGNMENTS 



RESULT 1 

US-09-902-540-12917 

; Sequence 12917, Application US/09902540 

; Patent No. 6833447 

; GENERAL INFORMATION: 

; APPLICANT: Goldman, Barry S. 

; APPLICANT: Hinkle, Gregory J. 

; APPLICANT: Slater, Steven C. 

; APPLICANT: Wiegand, Roger C. 

TITLE OF INVENTION: Myxococcus xanthus Genome Sequences and Uses Thereof 
; FILE REFERENCE: 38 - 1 0 ( 1584 9 ) B 
; CURRENT APPLICATION NUMBER: US/09/902,54 0 
; CURRENT FILING DATE: 2001-07-10 
; PRIOR APPLICATION NUMBER: 60/217,883 

PRIOR FILING DATE: 2000-07-10 
; NUMBER OF SEQ ID NOS : 16825 
; SEQ ID NO 12917 
LENGTH: 2 99 
TYPE : PRT 

ORGANISM: Myxococcus xanthus 
US-09-902-540-12917 



Query Match 



45.4%; Score 49; DB 2; Length 299; 



Best Local Similarity 53.8%; Pred. No. 7.2; 

Matches 7; Conservative 4; Mismatches 2; Indels 0; Gaps 0; 

Qy 7 EEDPYGHEDIYEE 19 

I I :|lh:|h 
Db 95 EADAFGHEELYEQ 107 



Search completed: December 17, 2005, 00:12:07 
Job time : 30.3571 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



December 16, 2005, 23:52:28 ; Search time 105.714 Seconds 

{ wi t hou t a 1 ignmen t s ) 
79.049 Million cell updates/sec 

US-10-789-494B-7 

108 

1 VETIVLEEDPYGHEDIYEED 20 



BL0SUM62 
Gapop 10.0 



Gapext 0.5 
1867569 seqs, 417829326 residues 



1867569 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published_Applications_AA_Main: * 

1: /cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB.pep:* 

2 : /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB .pep : * 

3 : /cgn2_6/ptodata/l/pubpaa/US09_PUBCOMB.pep: * 

4 : /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep: * 

5 : /cgn2_6 /ptodata/ 1 /pubpaa /US 1 0B_PUBCOMB . pep : * 

6 : /cgn2_6/ptodata/l/pubpaa/USll_PUBCOMB.pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



. 1 


108 


100 


. 0 


20 


5 


US-10-789-494B-7 


Sequence 


7 , Appl i 


2 


108 


100 


. 0 


20 


5 


US-10'789-494B-67 


Sequence 


67, 


Appl 


3 


108 


100 


. 0 


120 


5 


US-10-78 9-494B-23 


Sequence 


23, 


Appl 


4 


53 


49 


. 1 


127 


4 


US- 10-424 -599 -203223 


Sequence 


203223, 


5 


51 


47 


. 2 


113 


4 


US -10-424-599-20098 8 


Sequence 


200988, 


6 


49 


45 


. 4 


325 


5 


US-10-848-111-9 


Sequence 


9, Appli 


7 


49 


45 


- 4 


554 


4 


US-10 -424 -599-18 0031 


Sequence 


180031, 


8 


48 


44 


. 4 


159 


4 


US-10-258-105-14 


Sequence 


14, 


Appl 


9 


48 


44 


. 4 


277 


3 


US-09-978-295A-97 


Sequence 


97, 


Appl 


10 


48 


44 


. 4 


277 


3 


US-09-978-697-97 


Sequence 


97, 


Appl 


11 


48 


44 


. 4 


277 


3 


US- 09-978 -192A-97 


Sequence 


97, 


Appl 


12 


48 


44 


. 4 


277 


3 


US-09-999-832A-97 


Sequence 


97, 


Appl 


13 


48 


44 


. 4 


277 


3 


US-09-978-189-97 


Sequence 


97, 


Appl 


14 


48 


'44 


. 4 


277 


3 


US-09-978-608A-97 


Sequence 


97, 


Appl 


15 


48 


44 


. 4 


277 


3 


US-09-978-585A-97 


Sequence 


97, 


Appl 


16 


48 


44 


. 4 


277 


3 


US-09-978-191A-97 


Sequence 


97, 


Appl 


17 


48 


44 


. 4 


277 


3 


US-09-978 -4 03A-97 


Sequence 


97, 


Appl 


18 


48 


44 


. 4 


277 


3 


US-09-978-564A-97 


Sequence 


97, 


Appl 


19 


48 


44 


. 4 


277 


3 


US-09-999-833A-97 


Sequence 


97, 


Appl 


20 


48 


44 


. 4 


277 


3 


US-09-981-915A-97 


Sequence 


97, 


Appl 


2 1 


48 


44 


. 4 


277 


3 


US -09-978-824 -97 


Sequence 


97, 


Appl 


22 


48 


44 


. 4 


277 


3 


US-09-918-585A-97 


Sequence 


97, 


Appl 


23 


48 


44 


. 4 


277 


3 


US-09-999-834A-97 


Sequence 


97, 


Appl 


24 


48 


44 


, 4 


277 


3 


US-09-978-423A-97 


Sequence 


97, 


Appl 


25 


48 


44 


. 4 


277 


3 


US-09-978-193A-97 


Sequence 


97, 


Appl 


26 


48 


44 


. 4 


277 


3 


US-09- 999-83 OA- 97 


Sequence 


97, 


Appl 


27 


48 


44 


. 4 


277 


3 


US~09-978 -757A-97 


Sequence 


97, 


Appl 


28 


48 


44 


4 


277 


3 


US -09-978 -18 7B-97 


Sequence 


97, 


Appl 


29 


48 


44 


4 


277 


3 


US- 09 -978- 64 3A- 97 


Sequence 


97, 


Appl 


30 


48 


44 


4 


277 


3 


US-09-978-375A-97 


Sequence 


97, 


Appl 


31 


48 


44 


4 


277 


3 


US-09-978-298A-97 


Sequence 


97, 


Appl 


32 


48 


44 


4 


277 


3 


US-09-978-188A-97 


Sequence 


97, 


Appl 


33 


48 


44 


4 


277 


3 


US-09-978-681A-97 


Sequence 


97, 


Appl 


34 


48 


44 


4 


277 


3 


US-09-978-194A-97 


Sequence 


97, 


Appl 


35 


48 


44 


4 


277 


3 


US- 09- 999-82 9A- 97 


Sequence 


97, 


Appl 


36 


48 


44 


4 


277 


3 


US-09-978-2 99A-97 


Sequence 


97, 


Appl 


37 


48 


44 


4 


277 


3 


US-09-978-544A-97 


Sequence 


97, 


Appl 


38 


48 


44 


4 


277 


3 


US-09-978-665A-97 


Sequence 


97, 


Appl 


T Q 

o y 


A Q 
4 O 


A A 


4 


Z I I 


3 


T TO AO QTO O A 7\ AT 

Ub - 09 - 978 -0 02 A- 97 


Sequence 


97, 


Appl 


40 


48 


44 


4 


277 


3 


US-09-999-831A-97 


Sequence 


97, 


Appl 


41 


48 


44, 


4 


211 


3 


US-09-978-824-97 


Sequence 


97, 


Appl 


42 


48 


44, 


4 


277 


4 


US-10-052-586-100 


Sequence 


100, 


App 


43 


48 


44 . 


4 


277 


4 


US-10-174-590-100 


Sequence 


100, 


App 


44 


48 


44 . 


4 


277 


4 


US-10-176-758-100 


Sequence 


100, 


App 


45 


48 


44 . 


4 


277 


4 


US-10-175-737-100 


Sequence 


100, 


App 



ALIGNMENTS 



RESULT 1 

US-10-789-494B-7 

; Sequence 7, Application US/10789494B 

; Publication No. US20050143296A1 

; GENERAL INFORMATION: 

; APPLICANT: TSUBOUCHI , Kozo 



; APPLICANT: YAMADA, Hiromi 

; TITLE OF INVENTION: EXTRACTION AND UTILIZATION OF CELL 

; TITLE OF INVENTION: GROWTH- PROMOTING PEPTIDES FROM SILK PROTEIN 

; FILE REFERENCE: OPS 635 

; CURRENT APPLICATION NUMBER: US/lO/789 , 494B 

; CURRENT FILING DATE: 2 004-02-27 

; PRIOR APPLICATION NUMBER: JP 2003-55048 

; PRIOR FILING DATE: 2003-02-28 

; NUMBER OF SEQ ID NOS : 85 

; SEQ ID NO 7 

LENGTH: 20 

TYPE: PRT 

ORGANISM: Antheraea yamamai 
US-10-789-494B-7 



Query Match 100.0%; Score 108; DB 5; Length 20; 

Best Local Similarity 100.0%; Pred. No. 2.7e-09; 

Matches 20; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 VETIVLEEDPYGHEDIYEED 20 

llllllllllllllllllll 
Db 1 VETIVLEEDPYGHEDIYEED 20 



RESULT 4 

US-10-424 -599-203223 

Sequence 203223, Application US/10424599 
Publication No. US20040031072A1 
GENERAL INFORMATION: 



La Rosa Thomas J 
Kovalic David K 
Zhou Yihua 
Cao Yongwei 

Soy Nucleic Acid Molecules and Other Molecules Associated 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF INVENTION 
With 

TITLE OF INVENTION: Plants and Uses Thereof for Plant Improvement 
FILE REFERENCE: 38 -21 (53223 ) B 
CURRENT APPLICATION NUMBER: US/ 1 0/424 , 599 
CURRENT FILING DATE: 2 003-04-28 
NUMBER OF SEQ ID NOS: 285684 
SEQ ID NO 203223 
LENGTH: 127 
TYPE: PRT 

ORGANISM: Glycine max 
FEATURE : 

OTHER INFORMATION: Clone ID: PAT_MRT3847_25535C . 1 
US-10-424 -599-203223 



.pep 



Query Match 49.1%; 
Best Local Similarity 50.0%; 
Matches 10; Conservative 



Score 53; DB 4; 
Pred . No . 4.2; 
5; Mismatches 



Qy 

Db 



3 TIVLEED--PYGHEDIYEED 20 

:::| II I Ih HIM 
108 SVILNEDSRPCGHKCLYEED 127 



Length 127; 
3; Indels 2; Gaps 



1; 



Search completed: December 17, 2005, 00:17:33 
Job time : 106.714 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 



December 16, 2005, 23:53:24 ; Search time 5.35714 Seconds 

{without alignments) 
25.151 Million cell updates/sec 

US-10-789-494B-7 

108 

1 VETIVLEEDPYGHEDIYEED 20 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



51470 seqs, 6736768 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



51470 



Post -processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_AA_New: * 

1 : /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB.pep: * 

2 : /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB .pep : * 

3 : /cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB .pep : * 

4 : /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB.pep: * 

5 : /cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB .pep : * 

6 : /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB .pep : * 

7 : / cgn2_6 /p t oda t a / 1 /pubpaa /US 1 1_NEW_PUB . pep : * 

8 : /cgn2_6/ptodata/l/pubpaa/US60__NEW_PUB.pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 



Score 



Query 

Match Length DB 



ID 



Description 



1 


45 


41 


7 


98 


6 


US' 


10 


-467 


-657-3490 


Sequence 


3490, Ap 


2 


43 


39 


8 


963 


6 


us- 


10 


-467 


-962B-2 


Sequence 


2, Appli 


3 


42 


38 


9 


459 


7 


us- 


11 


-186 


-284-12 


Sequence 


12, Appl 


4 


42 


38 


9 


2647 


6 


us- 


10 


-821 


-234-1303 


Sequence 


1303, Ap 


5 


40 


37 


0 


21 


6 


us- 


10 


-939 


-890-584 


Sequence 


584, App 



6 


40 


37 


.0 


331 


6 


US- 


10 


-995 


-561-977 


Sequence 


977, App 


7 


40 


37 


.0 


331 


6 


US- 


10 


-995 


-561-978 


Sequence 


978, App 


8 


40 


37 


.0 


3655 


7 


US- 


11 


-075 


-185-5 


Sequence 


5, Appli 


9 


39.5 


36 


.6 


450 


7 


us- 


11 


-074 


-176-212 


Sequence 


212, App 


10 


39 


36 


.1 


651 


7 


us- 


11 


-172 


-145-10 


Sequence 


10, Appl 


11 


39 


36 


.1 


1116 


6 


us- 


10 


-485 


-517-238 


Sequence 


238, App 


12 


39 


36 


.1 


1117 


6 


us- 


10 


-485 


-517-206 


Sequence 


206, App 


13 


39 


36 


.1 


1236 


7 


us- 


11 


-115 


-086-4 


Sequence 


4, Appli 


14 


38 


35 


2 


414 


6 


us- 


10 


-467 


-657-2432 


Sequence 


2432, Ap 


15 


38 


35 


2 


564 


7 


us- 


11 


-186 


-284-199 


Sequence 


199, App 


16 


37 


34 


3 


216 


6 


us- 


10 


-793 


-626-2624 


Sequence 


2624, Ap 


17 


37 


34 


.3 


307 


6 


us- 


10 


-793 


-626-684 


Sequence 


684, App 


18 


37 


34 


3 


457 


6 


us- 


10 


-763 


-712A-61 


Sequence 


61, Appl 


19 


37 


34 


3 


1065 


6 


us- 


10 


-793 


-626-1212 


Sequence 


1212, Ap 


20 


37 


34 


3 


1152 


7 


us- 


11 


-080 


-026-4 


Sequence 


4, Appli 


21 


36.5 


33 


.8 


126 


6 


us- 


10 


-821 


-234-1318 


Sequence 


1318, Ap 


22 


36 


33 


3 


265 


6 


us- 


10 


-793 


-626-2422 


Sequence 


2422, Ap 


23 


36 


33 


3 


272 


6 


us- 


10 


-793 


-626-2290 


Sequence 


2290, Ap 


24 


36 


33 


3 


286 


6 


us- 


10 


-793 


-626-1020 


Sequence 


1020, Ap 


25 


36 


33 


3 


356 


6 


us- 


10 


-793 


-626-2054 


Sequence 


2054, Ap 


26 


36 


33 


3 


388 


6 


us- 


10 


-467 


-657-786 


Sequence 


78 6, App 


27 


36 


33 


.3 


402 


6 


us- 


10 


-485 


-517-422 


Sequence 


422, App 


28 


36 


33 


3 


502 


6 


us- 


10 


-131 


-826A-548 


Sequence 


548, App 


29 


36 


33 


.3 


502 


6 


us- 


10 


-689 


-742-148 


Sequence 


148, App 


30 


36 


33 


.3 


505 


6 


us- 


10 


-467 


-657-5148 


Sequence 


5148, Ap 


31 


36 


33 


.3 


615 


7 


us- 


11 


-172 


-145-6 


Sequence 


6, Appli 


32 


36 


33 


.3 


617 


7 


us- 


11 


-172 


-145-8 


Sequence 


8, Appli 


33 


36 


33 


.3 


695 


6 


us- 


10 


-363 


-924-2 


Sequence 


2, Appli 


34 


36 


33 


.3 


892 


6 


us- 


10 


-507 


-275-3 


Sequence 


3, Appli 


35 


36 


33 


.3 


903 


7 


us- 


11 


-057 


-058-65 


Sequence 


65, Appl 


36 


36 


33 


.3 


904 


6 


us- 


10 


-507 


-275-5 


Sequence 


5, Appli 


37 


36 


33 


.3 


904 


7 


us- 


11 


-087 


-227-12 


Sequence 


12, Appl 


38 


36 


33 


3 


1442 


6 


us- 


10 


-793 


-626-2052 


Sequence 


2052, Ap 


39 


35.5 


32 


9 


635 


6 


us- 


10 


-821 


-234-927 


Sequence 


927, App 


40 


35.5 


32 


9 


2261 


6 


us- 


10 


-995 


-561-600 


• Sequence 


600, App 


41 


35 


32 


4 


154 


6 


us- 


10 


-467 


-657-158 


Sequence 


158, App 


42 


35 


32 


4 


154 


6 


us- 


10 


-467 


-657-6520 


Sequence 


6520, Ap 


43 


35 


32 


4 


154 


6 


us- 


10 


-467 


-657-7634 


Sequence 


7634, Ap 


44 


35 


32 


4 


182 


7 


us- 


11 


-000 


-463-717 


Sequence 


717, App 


45 


35 


32 


4 


219 


6 


us- 


10 


-467 


-657-4956 


Sequence 


4956, Ap 



ALIGNMENTS 



RESULT 1 

US-10-467-657-3490 

; Sequence 34 90, Application US/10467657 

; Publication No. US20050260581A1 

; GENERAL INFORMATION: 

; APPLICANT: CHIRON SpA 

; APPLICANT: FONTANA Maria Rita 

; APPLICANT: PIZZA Mariagrazia 

; APPLICANT: MASIGNANI Vega 

; APPLICANT: MONACI Elisabetta 

; TITLE OF INVENTION: GONOCOCCAL PROTEINS AND NUCLEIC ACIDS 
; FILE REFERENCE: 



; CURRENT APPLICATION NUMBER: US/lO/467 , 657 
; CURRENT FILING DATE: 2003-08-11 

PRIOR APPLICATION NUMBER: GB-0103424.8 
; PRIOR FILING DATE: 2001-02-12 
; NUMBER OF SEQ ID NOS : 9218 
; SOFTWARE: SeqWin99, version 1.04 
; SEQ ID NO 3490 

LENGTH: 98 

TYPE: PRT 
; ORGANISM: Neisseria gonorrhoeae 
US-10-467-657-3490 

Query Match 41.7%; Score 45; DB 6; Length 98; 

Best Local Similarity 53.8%; Pred. No. 1.4; 

Matches 7; Conservative 2; Mismatches 4; Indels 0; Gaps 0; 

Qy 7 EEDPYGHEDIYEE 19 

hill I :| I 
Db 75 EDDPYVHAGVYSE 87 



Search completed: December 17, 2005, 00:17:54 
Job time : 5.35714 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 



December 16, 2005, 23:33:32 ; Search time 20.7143 Seconds 

(without alignments) 
92.899 Million cell updates/sec 

US-10-789-494B-7 
108 

1 VETIVLEEDPYGHEDIYEED 20 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



283416 seqs, 96216763 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283416 



Database 



PIR_80:* 
1: pirl:* 
2: pir2:* 



3 : pir3 : * 
4: pir4:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 
No. 


Score 


Query 

Match Length 


DB 


ID 


Description 


1 


108 


100 


. 0 


2639 


2 


T31328 


fibroin - Chinese 


2 


50 


46 


. 3 


1100 


2 


AGIO 97 


alpha-xylosidase a 


3 


47 


43 


. 5 


221 


2 


A84638 


hypothetical prote 


4 


47 


43 


. 5 


636 


2 


F69027 


cleavage and polya 


5 


47 


43 


. 5 


937 


1 


S42366 


endopeptidase La h 


6 


47 


43 


. 5 


962 


1 


S57342 


endopeptidase La h 


7 


47 


43 


. 5 


1005 


2 


T12546 


hypothetical prote 


8 


47 


43 


.5 


1090 


2 


S59077 


cellulose 1 , 4-beta 


9 


46 


42 


. 6 


68 


2 


S27148 


gene 30.4 protein 


10 


46 


42 


. 6 


216 


2 


E89848 


conserved hypothet 


11 


45.5 


42 


. 1 


288 


2 


AH1399 


ABC transporter (A 


12 


45.5 


42 


. 1 


288 


2 


AG1775 


ABC transporter (A 


13 


45.5 


42 


. 1 


1165 


2 


A70423 


valine-tRNA ligase 


14 


45 


41 


. 7 


92 


2 


E81209 


conserved hypothet 


15 


45 


41 


. 7 


96 


2 


E81786 


conserved hypothet 


16 


45 


41 


. 7 


129 


2 


F72425 


2 -amino -4 -hydroxy - 


17 


45 


41 


. 7 


286 


1 


JQ2154 


frxC protein - Pie 


18 


45 


41 


. 7 


288 


2 


AF2440 


protochlorophyllid 


19 


45 


41 


. 7 


314 


2 


F84044 


malate dehydrogena 


20 


45 


41 


. 7 


382 


2 


B49193 


type II activin re 


21 


45 


41 


. 7 


429 


2 


AB3550 


NAD{P) transhydrog 


22 


45 


41 


. 7 


504 


2 


B40829 


activin receptor i 


23 


45 


41 


. 7 


512 


2 


D40829 


activin receptor i 


24 


45 


41 , 


. 7 


512 


2 


137134 


activin type II re 


25 


45 


41 . 


. 7 


513 


2 


JQ1484 


activin receptor p 


26 


45 


41 . 


. 7 


614 


2 


JH0500 


zinc finger protei 


27 


45 


41 , 


. 7 


726 


2 


D97012 


probable process iv 


28 


45 


4 i , 


. / 


O 1 ft ^ 

2206 


2 


G71611 


hypothetical prote 


29 


44 


40. 


.7 


300 


2 


A81418 


pseudouridylate sy 


30 


44 


40. 


.7 


357 


2 


S43278 


cell division cont 


31 


44 


40. 


.7 


376 


2 


A71175 


probable dehydroge 


32 


44 


40. 


.7 


407 


2 


F72772 


probable valine-py 


33 


44 


40. 


.7 


410 


2 


E75208 


probable valine-py 


34 


44 


40. 


,1 


539 


2 


F75497 


probable arginine 


35 


44 


40. 


.7 


695 


2 


T40451 


n-terminal acetylt 


36 


44 


40. 


.7 


850 


2 


F95260 


ABC transporter, p 


37 


44 


40. 


, 7 


850 


2 


A98126 


conserved hypothet 


38 


44 


40. 


,7 


973 


2 


T35238 


probable secreted 


39 


43.5 


40. 


,3 


526 


1 


G71081 


probable helicase 


40 


43.5 


40. 


3 


789 


2 


T38423 


hypothetical prote 


41 


43.5 


40. 


3 


1684 


2 


T02367 


hypothetical prote 


42 


43 


39. 


8 


89 


2 


T27015 


hypothetical prote 


43 


43 


39. 


8 


98 


2 


A70301 


ribosomal protein 


44 


43 


39. 


8 


114 


1 


E64423 


ychN protein homol 


45 


43 


39. 


8 


142 


2 


A97409 


hypothetical prote 



ALIGNMENTS 



RESULT 1 
T31328 

fibroin - Chinese oak silkmoth 

C; Species: Antheraea pernyi (Chinese oak silkmoth) 

C;Date: 22-Oct-1999 #sequence_revision 22-Oct-1999 #text_change 09-Jul-2004 
C; Access ion: T31328 

R;Sezutsu, H. ; Tamura, T.; Yukuhiro, K. 
submitted to the EMBL Data Library, August 1998 

A; Description: Characterization of the full length fibroin gene of a wild 
silkworm, Antheraea pernyi. 
A/Reference number: Z20995 
A; Access ion: T3132 8 

A/Status : preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A/Residues : 1-2639 <SEZ> 

A; Cross -references: UNI PROT: 076786; UNIPARC:UPI0000078D8E; EMBL: AF083334 ; 
NID:g3450882; PID: g3450883 ; PIDN : AAC32606 . 1 
C; Genetics : 
A;Introns: 14/3 

Query Match 100.0%; Score 108; DB 2; Length 2639; 

Best Local Similarity 100.0%; Pred. No. 1.5e-07; 

Matches 20; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 VETIVLEEDPYGHEDIYEED 20 

IIIIMIIIIIIIillllM 

Db 61 VETIVLEEDPYGHEDIYEED 80 



RESULT 2 
AG1097 

alpha-xylosidase and alpha-glucosidase homolog lmo0182 [imported] ~ Listeria 

monocytogenes (strain EGD-e) 

C; Species: Listeria monocytogenes 

C;Date: 27-NOV-2001 #sequence_revision 27-Nov-2001 #text_change 09-Jul-2004 
C;Accession: AG1097 

R;Glaser, P.; Frangeul, L. ; Buchrieser, C. ; Amend, A.; Baquero, F.; Berche, P.; 
Bloecker, H.; Brandt, P.; Chakraborty, T. ; Charbit, A.; Chetouani, F.; Couve, 
E.; de Daruvar, A.; Dehoux, P.; Domann, E. ; Dominguez-Bernal , G. ; Duchaud, E. ; 
Durand, L. ; Dussurget, 0.; Entian, K.D.; Fsihi, H. ; Garcia-Del Portillo, F.; 
Garrido, P.; Gautier, L.; Goebel, W.; Gomez-Lopez, N.; Hain, T.; Hauf, J.; 
Jackson, D. ; Jones, L.M.; Karst, U. 
Science 294, 849-852, 2001 

A;Authors: Kref t , J.; Kuhn, M. ; Kunst, F.; Kurapkat, G. ; Madueno, E. ; 
Maitournam, A.; Mata Vicente, J.; Ng, E.; Nordsiek, G.; Novella, S.; de Pablos, 
B.; Perez-Diaz, J.C.; Remmel, B.; Rose, M.; Rusniok, C. ; Schlueter, T. ; Simoes, 
N.; Tierrez, A.; Vazquez -Boland; J. A.; Voss, H. ; Wehland, J.; Cossart, P. 
A;Title: Comparative genomics of Listeria species. 
A;Reference number: AB1077; MUID: 21537279; PMID: 11679669 
A /Access ion: AG1097 
A; Status : prel iminary 
A;Molecule type: DNA 



A/Residues: 1-1100 <GLA> 

A /Cross-references: UNIPROT:Q8YAE8; UNIPARC:UPI0000055794; GB : NC_003210 ; 
PIDN:CAC98397.1; PID : gl64 0953 9 ; GSPDB : GN00177 
A; Experimental source: strain EGD-e 
C;Genetics : 
A;Gene: lmo0182 

Query Match 46.3%; Score 50; DB 2; Length 1100; 

Best Local Similarity 47.1%; Pred. No. 28; 

Matches 8; Conservative 4; Mismatches 5; Indels 0; Gaps 0; 

Qy 3 TIVLEEDPYGHEDIYEE 19 

I : Ihllh Ih 
Db 242 TAHISHDPFGHENRYEQ 258 



Search completed: December 16, 2005, 23:53:16 
Job time : 22.7143 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: December 16, 2005, 23:33:02 ; Search time 127.857 Seconds 

(without alignments) 
110.362 Million cell updates/sec 

Title: US-10-789-4 94B-7 

Perfect score: 108 

Sequence: 1 VETIVLEEDPYGHEDIYEED 2 0 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 2166443 seqs, 705528306 residues 

Total number of hits satisfying chosen parameters: 2166443 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : UniProt_05 . 80 : * 

1 : uniprot_sprot : * 
2 : uniprot_trembl : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query- 
No. Score Match Length DB ID 



Description 



1 


108 


100 


.0 


507 


2 


Q8ISB3_9NEOP 


Q8isb3 


antheraea m 


2 


108 


100 


. 0 


2639 


2 


076786 ANTPE 


076786 


antheraea p 


3 


108 


100 


.0 


2655 


2 


Q964F4 ANTYA 


Q964f4 


antheraea y 


4 


66 


61 


.1 


1468 


2 


Q9GUB5_GALME 


Q9gub5 


galleria me 


5 


58 


53 


. 7 


509 


2 


Q7Z010_PLOIN 


Q7z010 


plodia inte 


6 


58 


53 


. 7 


735 


2 


Q7Z009_9NEOP 


Q7z009 


anagasta ku 


7 


56 


51 


. 9 


1065 


1 


DP03A_STAAR 


Q6gg04 


staphylococ 


8 


53 


49 


.1 


275 


2 


Q7NI14_GL0VI 


Q7nil4 


gloeobacter 


9 


50 


46 


.3 


358 


2 


Q8I2X6_PLAF7 


Q8i2x6 


Plasmodium 


10 


50 


46 


.3 


447 


2 


Q8PWT5 METMA 


Q8pwt5 


methanosarc 


11 


50 


46 


.3 


1100 


2 


Q724N1_LISMF 


Q724nl 


listeria mo 


12 


50 


46 


.3 


1100 


2 


Q8YAE8_LISMO 


Q8yae8 


listeria mo 


13 


49 


45 


.4 


321 


2 


Q4JZ47_STRPN 


Q4jz47 


streptococc 


14 


49 


45 


.4 


321 


2 


Q4K056_STRPN 


Q4k056 


streptococc 


15 


49 


45 


.4 


325 


2 


Q9XB05_MYXXA 


Q9xb05 


myxococcus 


16 


49 


45 


.4 


359 


2 


Q74ZB3 ASHGO 


Q74zb3 


ashbya goss 


17 


49 


45 


.4 


569 


2 


Q869X0_DICDI 


Q869x0 


dictyostel i 


18 


49 


45 


.4 


637 


2 


Q5CC74_9CUCU 


Q5CC74 


ot iorhynchu 


19 


49 


45 


.4 


721 


2 


Q609H0_METCA 


Q609h0 


methyl ococc 


20 


49 


45 


.4 


2482 


2 


Q7YYZ4_CRYPV 


Q7yyz4 


crypt ospori 


21 


49 


45 


.4 


4605 


2 


Q8IlR3_PIiAF7 


Q8ilr3 


Plasmodium 


22 


48 .5 


44, 


.9 


201 


2 


Q8TZA3 METKA 


Q8tza3 


methanopyru 


23 


48.5 


44, 


.9 


569 


2 


Q6BXH0_DEBHA 


Q6bxh0 


debaryomyce 


24 


48.5 


44. 


.9 


619 


2 


Q8IEN9_PLAF7 


Q8ien9 


Plasmodium 


25 


48 


44. 


.4 


209 


2 


Q60ZN9_CAEBR 


Q60zn9 


c a eno rha bd i 


26 


48 


44 , 


.4 


277 


2 


Q6UW19 HUMAN 


Q6uwl9 


homo sapien 


27 


48 


44 . 


,4 


277 


2 


Q9Y6Z7_HUMAN 


Q9y6z7 


homo sapien 


28 


48 


44, 


,4 


864 


2 


Q8IL99_PLAF7 


Q8il99 


Plasmodium 


29 


48 


44 . 


.4 
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ALIGNMENTS 



RESULT 1 
Q8ISB3_9NE0P 

ID Q8ISB3_9NE0P PRELIMINARY; 



PRT; 507 AA. 



AC Q8ISB3; 

DT Ol-MAR-2003 (TrEMBLrel . 23, Created) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT Ol-OCT-2 003 (TrEMBLrel. 25, Last annotation update) 

DE Fibroin (Fragment) . 

OS Antheraea mylitta. 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Bombycoidea ; 

OC Saturniidae; Saturniinae; Satumiini; Antheraea. 

OX NCBI_TaxID=34739; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RA Datta A., Ghosh A.K., Kundu S.C.; 

RT "Differential expression of the fibroin gene in developmental stages 

RT of silkworm, Antheraea mylitta (Saturniidae)."; 

RL Comp. Biochem. Physiol. B, Comp. Biochem. 129:197-204(2001). 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RA Datta A., Ghosh A.K., Kundu S.C.; 

RL Submitted (JUL-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY136274; AAN28165.1; -; Genomic^DNA. 

FT NON_TER 507 5 07 

SQ SEQUENCE 507 AA; 45401 MW; C4FF5F3937268386 CRC64; 



Query Match 100.0%; Score 108; DB 2; Length 507; 

Best Local Similarity 100.0%; Pred. No. 4.2e-07; 

Matches 20; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 

Db 



1 VETIVLEEDPYGHEDIYEED 2 0 

IIIMIIIIIIIIIIIIIII 

60 VETIVLEEDPYGHEDIYEED 79 



Search completed: December 16, 2005, 23:52:14 
Job time : 129.857 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run. on: 



December 16, 2005, 23:08:17 ; Search time 81.25 Seconds 

(without alignments) 

70.301 Million cell updates/sec 



Title: US-10-78 9-494B-8 

Perfect score: 72 

Sequence: 1 DDGFVLDGGYDSE 13 

Scoring table: 



BLOSUM62 
Gapop 10.0 , Gapext 0.5 



Searched: 



2443163 seqs, 439378781 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



2443163 



Post -process ing : 



Database : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

A_Geneseq_21 : * 



geneseqpl98 0s : * 
geneseqpl990s : * 
geneseqp2000s : * 
geneseqp2001s : * 
geneseqp2002s : * 
geneseqp2003as : * 
geneseqp20 03bs : * 
geneseqp2004s : * 
geneseqp2005s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 

ADU51212 

ID ADU51212 standard; peptide; 13 AA. 
XX 

AC ADU51212; 
XX 

DT 24-FEB-2005 (first entry) 
XX 

DE Silkworm fibroin-derived fibroblast proliferation peptide 9. 
XX 

KW vulnerary; cell proliferation; wound healing; cell adhesion; cosmetics; 

KW cell culture; fibroin. 

XX 

OS Bombycoidea. 

OS Synthetic. 



XX 

PN JP2004339189-A. 
XX 

PD 02-DEC-2004. 
XX 

PF 04-DEC-2003; 2003JP-00406608 . 

XX 

PR 28'FEB-2003; 2003 JP-00055048 . 
XX 

PA (DOKU-) DOKURITSU GYOSEI HOJIN NOGYO SEIBUTSU SH, 

PA (TSUB/) TSUBOUCHI K. 

XX 

DR WPI; 2004-827614/82. 

XX 

PT New peptide having excellent cell growth promoting activity, for use as a 

PT cell growth promoter, cell adhesion agent, wound healing -promo ting agent, 

PT cosmetic and cell culture base material. 
XX 

PS Claim 2; Page; 27pp; Japanese. 
XX 

CC The invention relates to a novel peptide having excellent cell growth 

CC promoting activity. The peptide of the invention demonstrates vulnerary 

CC activity and may be utilised as a cell growth promoter, cell adhesion 

CC agent, wound healing -promo ting agent or cosmetic and cell culture base 

CC material. The current sequence is that of a silkworm fibroin-derived 

CC fibroblast proliferation peptide of the invention. 
XX 

SQ Sequence 13 AA; 

Query Match 100.0%; Score 72; DB 8; Length 13; 
Best Local Similarity 100.0%; Pred. No. 4.8e-05; 

Matches 13; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 DDGFVLDGGYDSE 13 

llillllllilll 

Db 1 DDGFVLDGGYDSE 13 



RESULT 5 

AAG19475 

ID AAG19475 Standard; protein; 138 AA. 
XX 

AC AAG19475; 

XX 

DT 17-OCT-2000 (first entry) 
XX 

DE Arabidopsis thaliana protein fragment SEQ ID NO: 21292. 
XX 

KW Protein identification; signal transduction pathway; metabolic pathway; 
KW hybridisation assay; genetic mapping; gene expression control; promoter; 
KW termination sequence. 
XX 

OS Arabidopsis thaliana. 
XX 

PN EP1033405-A2 . 
XX 
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99US- 


0158369P. 


PR 


13- 


-OCT- 


1999, 


99US- 


0159293P. 


PR 


13- 


-OCT- 


1999, 


99US- 


0159294P, 


PR 


13- 


-OCT- 


1999; 


99US- 


0159295P. 


PR 


14- 


-OCT- 


1999; 


99US- 


0159329P. 


PR 


14- 


-OCT- 


1999; 


9 9US- 


0159330P. 


PR 


14- 


-OCT- 


1999; 


99US- 


0159331P. 


PR 


14- 


-OCT- 


1999; 


99US- 


0159637P. 


PR 


14- 


-OCT- 


1999; 


99US- 


0159638P. 


PR 


18- 


-OCT- 


1999; 


99US- 


0159584P. 


PR 


21- 


-OCT- 


1999; 


99US- 


0160741P. 


PR 


21- 


-OCT- 


1999; 


99US- 


0160767P. 


PR 


21- 


-OCT- 


1999; 


99US- 


0160768P. 


PR 


21- 


-OCT- 


1999; 


99US- 


0160770P. 


PR 


21- 


-OCT- 


1999; 


99US- 


0160814P. 


PR 


21- 


•OCT- 


1999; 


99US- 


0160815P. 



PR 


22 


-OCT- 


1999 


99US- 


0160980P 


PR 


22 


-OCT- 


1999 


99US- 


0160981P 


PR 


22 


-OCT- 


1999 


99US- 


0160989P 


PR 


25 


-OCT- 


1999 


99US- 


0161404P 


PR 


25 


-OCT- 


1999 


99US- 


0161405P 


PR 


25 


-OCT- 


1999 


99US- 


0161406P 


PR 


26 


-OCT- 


1999 


99US- 


0161359P 


PR 


26 


-OCT- 


1999, 


99US- 


0161360P 


PR 


26 


-OCT- 


1999, 


99US- 


0161361P 


PR 


28 


-OCT- 


1999, 


99US- 


0161920P 


PR 


28 


-OCT- 


1999, 


99US- 


0161992P 


PR 


28 


-OCT- 


1999, 


99US- 


0161993P 


PR 


29 


-OCT- 


1999, 


99US- 


0162142P 



Query Match 62.5%; 
Best Local Similarity 63.6%; 
Matches 7; Conservative 



Score 45; DB 3; 
Pred. No. 18; 
3; Mismatches 



Length 138; 
1; Indels 



0 ; Gaps 



0; 



Qy 

Db 



1 DDGFVLDGGYD 11 

III ::|lhl 
4 0 DDGELIDGGFD 50 



Search completed: December 16, 2005, 23:46:13 
Job time : 84.25 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



December 16, 2005, 23:40:43 ; Search time 19.7321 Seconds 

(without alignments) 
54.469 Million cell updates/sec 

US-10-789-494B-8 
72 

1 DDGFVLDGGYDSE 13 
BLOSUM62 

Gapop 10,0 , Gapext 0.5 



572060 seqs, 82675679 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



572060 



Database 



Issued_Patents_AA: * 
1 : / cgn2_6 /p t oda t a / 1 / i aa / 5_C0MB . pep : * 



2 : / cgn2_6 /p t oda ta/l/iaa/ 6_C0MB . pep : * 

3 : /cgn2_6/ptodata/l/iaa/H_C0MB.pep: * 

4 : /cgn2_6/ptodata/l/iaa/PCTUS__COMB.pep: * 

5 : /cgn2_6/ptodata/l/iaa/RE_COMB,pep: * 

6 : /cgn2_6/ptodata/l/iaa/backf ilesl .pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


40 


55 


.6 


115 


2 


US- 


09 


-991 


-181-227 


Sequence 


227, App 


2 


40 


55 


6 


115 


2 


US- 


09 


-990 


-444-227 


Sequence 


227, App 


3 


40 


55 


6 


115 


2 


us- 


09 


-997 


-333-227 


Sequence 


227, App 


4 


40 


55 


.6 


115 


2 


us- 


09 


-992 


-598-227 


Sequence 


227, App 


5 


40 


55 


.6 


243 


2 


US" 


09 


-134 


-OOOC-4106 


Sequence 


4106, Ap 


6 


40 


55 


.6 


459 


2 


us- 


09 


-303 


-518D-214 


Sequence 


214, App 


7 


40 


55 


6 


620 


2 


us- 


09 


-248 


-796A-20646 


Sequence 


20646, A 


8 


39 


54 


.2 


100 


1 


us- 


08 


-464 


-517-11 


Sequence 


11, Appl 


9 


39 


54 


.2 


100 


1 


us- 


08 


-246 


-361A-11 


Sequence 


11, Appl 


10 


39 


54 


2 


100 


2 


us- 


08 


-463 


-772-11 


Sequence 


11, Appl 


11 


39 


54 


.2 


100 


4 


PCT-US93- 


05000-11 


Sequence 


11, Appl 


12 


39 


54 


.2 


148 


1 


us- 


08 


-460 


-694-7 


Sequence 


7, Appli 


13 


39 


54 


2 


148 


2 


us- 


08 


-460 


-744-7 


Sequence 


1, Appli 


14 


39 


54 


2 


148 


2 


us- 


07 


-667 


-711B-7 


Sequence 


7, Appli 


15 


39 


54 


2 


159 


2 


us- 


09 


-270 


-767-32900 


Sequence 


32900, A 


16 


39 


54 


.2 


159 


2 


us- 


09 


-270 


-767-48117 


Sequence 


48117, A 


17 


39 


54 


.2 


318 


2 


us- 


09 


-120 


-051D-42 


Sequence 


42 , Appl 


18 


38 


52 


.8 


16 


2 


us- 


08 


-822 


-322-4 


Sequence 


4, Appli 


19 


38 


52 


.8 


16 


2 


us- 


09 


-466 


-109-4 


Sequence 


4, Appli 


20 


38 


52 


.8 


44 


2 


us- 


10 


-241 


-602B-11 


Sequence 


11, Appl 


21 


38 


52 


8 


. 126 


2 


us- 


09 


-471 


-276-1485 


Sequence 


1485, Ap 


22 


38 


52 


8 


173 


2 


us- 


09 


-311 


-021-132 


Sequence 


132, App 


23 


38 


52 


8 


174 


2 


us- 


09 


-149 


-476-424 


Sequence 


424, App 


24 


38 


52 


.8 


192 


2 


us- 


09 


-248 


-796A-17957 


Sequence 


17957, A 


25 


38 


52 


8 


251 


2 


us- 


08 


-822 


-322-9 


Sequence 


9, Appli 


26 


38 


52 


8 


251 


2 


us- 


09 


-466 


-109-9 


Sequence 


9, Appli 


27 


38 


52 


8 


252 


2 


us- 


08 


-822 


-322-8 


Sequence 


8, Appli 


28 


38 


52 


8 


252 


2 


us- 


09 


-466 


-109-8 


Sequence 


8, Appli 


29 


38 


52 


8 


290 


2 


us- 


09 


-248 


-796A-18749 


Sequence 


18749, A 


30 


38 


52 


8 


507 


2 


us- 


09 


-198 


-452A-174 


Sequence 


174, App 


31 


38 


52 


8 


536 


2 


us- 


09 


-252 


-991A-33121 


Sequence 


33121, A 


32 


38 


52 


8 


714 


2 


us- 


09 


-248 


-796A-18753 


Sequence 


18753, A 


33 


38 


52 


8 


816 


1 


us- 


08 


-820 


-170A-37 


Sequence 


37, Appl 


34 


38 


52 


8 


816 


2 


us- 


09 


-055 


-699-37 


Sequence 


37, Appl 


35 


38 


52 


8 


816 


2 


us- 


09 


-273 


-565-37 


Sequence 


37, Appl 


36 


38 


52 


8 


816 


2 


us- 


09 


-565 


-538-37 


Sequence 


37, Appl 


37 


38 


52 


8 


816 


2 


us- 


09 


-661 


-468-37 


Sequence 


37, Appl 


38 


38 


52 


8 


816 


2 


us- 


09 


-976 


-165-37 


Sequence 


37, Appl 


39 


38 


52. 


8 


821 


2 


us- 


09 


-438 


-185A-155 


Sequence 


155, App 


40 


38 


52 


8 


1098 


2 


us- 


09 


-602 


-787A-48 


Sequence 


48, Appl 


41 


38 


52 


8 


1874 


2 


us- 


09 


-602 


-787A-46 


Sequence 


46, Appl 


42 


38 


52. 


8 


5069 


2 


us- 


10 


-042 


-665A-5 


Sequence 


5, Appli 



43 
44 
45 



37 51.4 51 1 US-08-870-518-24 

37 51,4 115 1 US-08-542-363-39 
37 51.4 115 2 US-09-100-089-39 



Sequence 24, Appl 
Sequence 39, Appl 
Sequence 39, Appl 



ALIGNMENTS 



RESULT 1 

US-09-991-181-227 

Sequence 227, Application US/09991181 
Patent No. 6913919 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Ashkenazi , Avi J. 
Baker, Kevin P. 
Bot stein, David 
Desnoyers , Luc 
Eaton, Dan L. 
Ferrara , Napoleone 
Fong, Sherman 
Gerber , Hanspeter 
Gerritsen,Mary E. 
Goddard , Audrey 
Godowski , Paul J . 
Grimaldi , J . Christopher 
Gurney, Austin L. 
Kl javin, Ivar J. 
Napier, Mary A, 
Pan, James 
Paoni, Nicholas F. 
Roy, Margaret Ann 
Stewart, Timothy A. 
Tumas, Daniel 
Watanabe, Colin K. 
Williams, P. Mickey 
Wood, William I . 
Zhang, Zemin 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2730P1C53 

CURRENT APPLICATION NUMBER: US/09/991,181 
CURRENT FILING DATE: 2001-11-16 
PRIOR APPLICATION NUMBER: 60/049787 
PRIOR FILING DATE: 1997-06-16 
PRIOR APPLICATION NUMBER: 60/062250 
PRIOR FILING DATE: 1997-10-17 
PRIOR APPLICATION NUMBER: 60/065186 
PRIOR FILING DATE: 1997-11-12 
PRIOR APPLICATION NUMBER: 60/065311 
PRIOR FILING DATE: 1997-11-13 
PRIOR APPLICATION NUMBER: 60/066770 
PRIOR FILING DATE: 1997-11-24 
PRIOR APPLICATION NUMBER: 60/075945 
PRIOR FILING DATE: 1998-02-25 
PRIOR APPLICATION NUMBER: 60/078910 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/083322 



PRIOR FILING DATE: 1998-04-28 
PRIOR APPLICATION NUMBER: 60/084600 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/087106 
PRIOR FILING DATE: 1998-05-28 
PRIOR APPLICATION NUMBER: 60/087607 
PRIOR FILING DATE: 1998-06-02 
PRIOR APPLICATION NUMBER: 60/087609 
PRIOR FILING DATE: 1998-06-02 
PRIOR APPLICATION NUMBER: 60/087759 
PRIOR FILING DATE: 1998-06-02 
PRIOR APPLICATION NUMBER: 60/087827 
PRIOR FILING DATE: 1998-06-03 
PRIOR APPLICATION NUMBER: 60/088021 
PRIOR FILING DATE: 1998-06-04 
PRIOR APPLICATION NUMBER: 60/088025 
PRIOR FILING DATE: 1998-06-04 
PRIOR APPLICATION NUMBER: 60/088026 
PRIOR FILING DATE: 1998-06-04 
PRIOR APPLICATION NUMBER: 60/088028 
PRIOR FILING DATE: 1998-06-04 
PRIOR APPLICATION NUMBER: 60/088029 
PRIOR FILING DATE:. 1998-06-04 
PRIOR APPLICATION NUMBER: 60/088030 
PRIOR FILING DATE: 1998-06-04 
PRIOR APPLICATION NUMBER: 60/088 033 
PRIOR FILING DATE: 1998-06-04 
PRIOR APPLICATION NUMBER: 60/088326 
PRIOR FILING DATE: 1998-06-04 
PRIOR APPLICATION NUMBER: 60/088167 
PRIOR FILING DATE: 1998-06-05 
PRIOR APPLICATION NUMBER: 60/088202 
PRIOR FILING DATE: 1998-06-05 
PRIOR APPLICATION NUMBER: 60/088212 
PRIOR FILING DATE: 1998-06-05 
PRIOR APPLICATION NUMBER: 60/088217 
PRIOR FILING DATE: 1998-06-05 
PRIOR APPLICATION NUMBER: 60/088655 
PRIOR FILING DATE: 1998-06-09 
PRIOR APPLICATION NUMBER: 60/088734 
PRIOR FILING DATE: 1998-06-10 
PRIOR APPLICATION NUMBER: 60/088738 
PRIOR FILING DATE: 1998-06-10 
PRIOR APPLICATION NUMBER: 60/088742 
PRIOR FILING DATE: 1998-06-10 
PRIOR APPLICATION NUMBER: 60/088810 
PRIOR FILING DATE: 1998-06-10 
PRIOR APPLICATION NUMBER: 60/088824 
PRIOR FILING DATE: 1998-06-10 
PRIOR APPLICATION NUMBER: 60/088826 
PRIOR FILING DATE: 1998-06-10 
PRIOR APPLICATION NUMBER: 60/088858 
PRIOR FILING DATE: 1998-06-11 
PRIOR APPLICATION NUMBER: 60/088861 
PRIOR FILING DATE: 1998-06-11 
PRIOR APPLICATION NUMBER: 60/088876 
PRIOR FILING DATE: 1998-06-11 



PRIOR APPLICATION NUMBER: 60/089105 
PRIOR FILING DATE: 1998-06-12 
PRIOR APPLICATION NUMBER: 60/089440 
PRIOR FILING DATE: 1998-06-16 
PRIOR APPLICATION NUMBER: 60/089512 
PRIOR FILING DATE: 1998-06-16 
PRIOR APPLICATION NUMBER: 60/08 9514 
PRIOR FILING DATE: 1998-06-16 
PRIOR APPLICATION NUMBER: 60/089532 
PRIOR FILING DATE: 1998-06-17 
PRIOR APPLICATION NUMBER: 60/089538 
PRIOR FILING DATE: 1998-06-17 
PRIOR APPLICATION NUMBER: 60/089598 
PRIOR FILING DATE: 1998-06-17 
PRIOR APPLICATION NUMBER: 60/089599 
PRIOR FILING DATE: 1998-06-17 
PRIOR APPLICATION NUMBER: 60/089600 
PRIOR FILING DATE: 1998-06-17 
PRIOR APPLICATION NUMBER: 60/089653 
PRIOR FILING DATE: 1998-06-17 
PRIOR APPLICATION NUMBER: 60/0898 01 
PRIOR FILING DATE: 1998-06-18 
PRIOR APPLICATION NUMBER: 60/089907 
PRIOR FILING DATE: 1998-06-18 
PRIOR APPLICATION NUMBER: 60/089908 
PRIOR FILING DATE: 1998-06-18 
PRIOR APPLICATION NUMBER: 60/089947 
PRIOR FILING DATE: 1998-06-19 
PRIOR APPLICATION NUMBER: 60/089948 
PRIOR FILING DATE: 1998-06-19 
PRIOR APPLICATION NUMBER: 60/089952 
PRIOR FILING DATE: 1998-06-19 
PRIOR APPLICATION NUMBER: 60/090246 
PRIOR FILING DATE: 1998-06-22 
PRIOR APPLICATION NUMBER: 60/090252 
PRIOR FILING DATE: 1998-06-22 
PRIOR APPLICATION NUMBER: 60/090254 
PRIOR FILING DATE: 1998-06-22 
PRIOR APPLICATION NUMBER: 60/090349 
PRIOR FILING DATE: 1998-06-23 
PRIOR APPLICATION NUMBER: 60/090355 
PRIOR FILING DATE: 1998-06-23 
PRIOR APPLICATION NUMBER: 60/090429 
PRIOR FILING DATE:' 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090431 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090435 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090444 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090445 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090472 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090535 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090540 



; PRIOR FILING DATE: 1998-06-24 

; PRIOR APPLICATION NUMBER: 60/090542 

; PRIOR FILING DATE: 1998-06-24 

; PRIOR APPLICATION NUMBER: 60/090557 

; PRIOR FILING DATE: 1998-06-24 

; PRIOR APPLICATION NUMBER: 60/090676 

; PRIOR FILING DATE: 1998-06-25 

; PRIOR APPLICATION NUMBER: 60/090678 

PRIOR FILING DATE: 1998-06-25 

PRIOR APPLICATION NUMBER: 60/090690 

; PRIOR FILING DATE: 1998-06-25 

; PRIOR APPLICATION NUMBER: 60/090694 

; PRIOR FILING DATE: 1998-06-25 

; PRIOR APPLICATION NUMBER: 60/090695 

; PRIOR FILING DATE: 1998-06-25 

; PRIOR APPLICATION NUMBER: 60/090696 

; PRIOR FILING DATE: 1998-06-25 

PRIOR APPLICATION NUMBER: 60/090862 

; PRIOR FILING DATE: 1998-06-26 

; PRIOR APPLICATION NUMBER: 60/090863 

; PRIOR FILING DATE: 1998-06-26 

; PRIOR APPLICATION NUMBER: 60/091360 

; PRIOR FILING DATE: 1998-07-01 

; PRIOR APPLICATION NUMBER: 60/091478 

; PRIOR FILING DATE: 1998-07-02 

; PRIOR APPLICATION NUMBER: 60/091544 

; PRIOR FILING DATE: 1998-07-01 

; PRIOR APPLICATION NUMBER: 60/091519 

; PRIOR FILING DATE: 1998-07-02 

; PRIOR APPLICATION NUMBER: 60/091626 

; PRIOR FILING DATE: 1998-07-02 

; PRIOR APPLICATION NUMBER: 60/091633 

; PRIOR FILING DATE: 1998-07-02 

PRIOR APPLICATION NUMBER: 60/091978 

; PRIOR FILING DATE: 1998-07-07 

; PRIOR APPLICATION NUMBER: 60/091982 

; PRIOR FILING DATE: 1998-07-07 

; PRIOR APPLICATION NUMBER: 60/092182 

; PRIOR FILING DATE: 1998-07-09 

Query Match 55.6%; Score 40; DB 2; Length 115; 

Best Local Similarity 54.5%; Pred, No. 25; 

Matches 6; Conservative 4; Mismatches 1; Indels 

Qy 2 DGFVLDGGYDS 12 

:|| ::|lhl 
Db 81 NGFYINGGYES 91 



Search completed: December 17, 2005, 00:12:08 
Job time : 20.7321 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



December 16, 2005, 23:52:28 ; Search time 68.7143 Seconds 

( wi t hout a 1 ignmen t s ) 
79.049 Million cell updates/sec 



Title: US-10-789-494B-8 
Perfect score: 72 



Sequence : 



1 DDGFVLDGGYDSE 13 



1867569 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1867569 seqs, 417829326 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published_Applications__AA_Main: * 

1 : /cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB.pep: * 

2 : /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep: * 

3 : /cgn2_6/ptodata/l/pubpaa/US09_PUBCOMB.pep: * 

4 : /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep: * 

5: /cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB.pep:* 

6 : /cgn2_6/ptodata/l/pubpaa/USll_PUBC0MB .pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 

Match Length DB 


ID 








Description 


1 


72 


100 


.0 


13 


5 


US- 


10 


-789 


-494B-8 


Sequence 


8, Appli 


2 


72 


100 


.0 


13 


5 


US- 


■10 


-789 


-494B-68 


Sequence 


68, Appl 


3 


72 


100 


0 


28 


5 


US- 


10 


-789 


-494B-55 


Sequence 


55, Appl 


4 


46 


63 


9 


166 


4 


US" 


10 


-424 


-599-206658 


Sequence 


206658, 


5 


43 


59 


7 


173 


4 


US- 


10 


-424 


-599-265404 


Sequence 


265404, 


6 


43 


59 


7 


478 


4 


US- 


10 


-156 


-761-10964 


Sequence 


10964, A 


7 


42 


58 


3 


174 


4 


US- 


10 


-767 


-701-34455 


Sequence 


34455, A 


8 


42 


58 


3 


174 


4 


us- 


10 


-425 


-115-357755 


Sequence 


357755, 


9 


42 


58 


3 


174 


4 


us- 


10 


-425 


-115-357760 


Sequence 


357760, 


10 


42 


58 


3 


296 


4 


us- 


10 


-767 


-701-38933 


Sequence 


38933, A 


11 


42 


58 


3 


344 


5 


us- 


10 


-739 


-930-9631 


Sequence 


9631, Ap 


12 


42 


58 


3 


853 


4 


us- 


10 


-369 


-493-13852 


Sequence 


13852, A 


13 


42 


58 


3 


3649 


5 


us- 


10 


-732 


-923-20588 


Sequence 


20588, A 


14 


41 


56 


9 


86 


4 


us- 


10 


-156 


-761-9910 


Sequence 


9910, Ap 


15 . 


41 


56 


9 


91 


4 


us- 


10 


-424 


-599-188864 


Sequence 


188864, 


16 


41 


56 


9 


252 


4 


us- 


10 


-369 


-493-20496 


Sequence 


20496, A 



17 


41 


56 


. 9 


342 


4 


US- 


10 


-437 


-963-117472 


Sequence 


117472, 


18 


41 


56 


. 9 


360 


4 


us- 


10 


-461 


-194-151 


Sequence 


151, App 


19 


41 


56 


. 9 


819 


5 


us- 


10 


-503 


-135-116 


Sequence 


116, App 


20 


41 


56 


.9 


857 


4 


us- 


10 


-282 


-122A-67943 


Sequence 


67943, A 


21 


40.5 


56 


.2 


236 


4 


us- 


10 


-425 


-115-278601 


Sequence 


278601, 


22 


40. 5 


56 


2 


929 


4 


us- 


10 


-436 


-323-3 


Sequence 


3, Appli 


23 


40 


55 


.6 


53 


5 




10 


-450 


-763-54800 


Sequence 


54800, A 


24 


40 


55 


.6 


109 


4 


us- 


10 


-424 


-599-242651 


Sequence 


242651, 


25 


40 


55 


6 


111 


4 


us- 


10 


-767 


-701-38775 


Sequence 


38775, A 


26 


40 


55 


6 


115 


3 


us- 


09 


-989 


-722-227 


Sequence 


227, App 


27 


40 


55 


6 


115 


3 


us- 


09 


-989 


-723-227 


Sequence 


227, App 


28 


40 


55 


6 


115 


3 


us- 


09 


-989 


-279-227 


Sequence 


227, App 


29 


40 


55 


6 


115 


3 


us- 


09 


-989 


-727-227 


Sequence 


227, App 


30 


40 


55 


6 


115 


3 


us- 


09 


-989 


-731-227 


Sequence 


227, App 


31 


40 


55 


6 


115 


3 


us- 


09 


-989 


-732-227 


Sequence 


227, App 


32 


40 


55 


6 


115 


3 


us- 


09 


-991 


-073-227 


Sequence 


227, App 


33 


40 


55 


6 


115 


3 


us- 


09 


-990 


-442-227 


Sequence 


227, App 


34 


40 


55 


6 


115 


3 


us- 


09 


-991 


-163-227 


Sequence 


227, App 


35 


40 


55 


6 


115 


3 


us- 


09 


-993 


-604-227 


Sequence 


227, App 


36 


40 


55. 


6 


115 


3 


us- 


09 


-990 


-456-227 


Sequence 


227, App 


37 


40 


55 


6 


115 


3 


us- 


09 


-989 


-721-227 


Sequence 


227, App 


38 


40 


55 


6 


115 


3 


us- 


09 


-992 


-598-227 


Sequence 


227, App 


39 


40 


55 


6 


115 


3 


us- 


09 


-989 


-293A-227 


Sequence 


227, App 


40 


40 


55 


6 


115 


3 


us- 


09 


-989 


-735-227 


Sequence 


227, App 


41 


40 


55. 


6 


115 


3 


us- 


09 


-990 


-444-227 


Sequence 


227, App 


42 


40 


55. 


6 


115 


3 


us- 


09 


-991 


-181-227 


Sequence 


227, App 


43 


40 


55. 


6 


115 


3 


us- 


09 


-989 


-730-227 


Sequence 


227, App 


44 


40 


55. 


6 


115 


3 


us- 


09 


-990 


-436-227 


Sequence 


227, App 


45 


40 


55. 


6 


115 


3 




09 


-993 


-687-227 


Sequence 


227, App 



ALIGNMENTS 



RESULT 1 

US-10-789-494B-8 

; Sequence 8, Application US/10789494B 

; Publication No. US20050143296A1 

; GENERAL INFORMATION: 

; APPLICANT: TSUBOUCHI , Kozo 

; APPLICANT: YAMADA, Hiromi 

; TITLE OF INVENTION: EXTRACTION AND UTILIZATION OF CELL 

; TITLE OF INVENTION: GROWTH -PROMOTING PEPTIDES FROM SILK PROTEIN 

; FILE REFERENCE: OPS 635 

CURRENT APPLICATION NUMBER: US/lO/78 9 , 4 94B 
; CURRENT FILING DATE: 2004-02-27 
; PRIOR APPLICATION NUMBER: JP 2003-55048 
; PRIOR FILING DATE: 2003-02-28 
; NUMBER OF SEQ ID NOS : 85 
; SEQ ID NO 8 

LENGTH: 13 

TYPE : PRT 

ORGANISM: Antheraea yamamai 
US-10-789-494B-8 



Query Match 100.0%; Score 72; DB 5; Length 13; 

Best Local Similarity 100.0%; Pred. No. 4.4e-05; 



Matches 



13; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



1 DDGFVLDGGYDSE 13 



Db 




RESULT 4 

US- 10 -424 -599 -2 06658 

; Sequence 206658, Application US/10424599 

; Publication No. US20040031072A1 

; GENERAL INFORMATION: 

; APPLICANT: La Rosa Thomas J 

; APPLICANT: Kovalic David K 

; APPLICANT: Zhou Yihua 

; APPLICANT: Cao Yongwei 

TITLE OF INVENTION: Soy Nucleic Acid Molecules and Other Molecules Associated 
With 

; TITLE OF INVENTION: Plants and Uses Thereof for Plant Improvement 

; FILE REFERENCE: 38-21(53223)6 

; CURRENT APPLICATION NUMBER: US/l 0/424 , 599 

; CURRENT FILING DATE: 2003-04-28 

; NUMBER OF SEQ ID NOS : 285684 

; SEQ ID NO 206658 

LENGTH: 166 

TYPE: PRT 

ORGANISM: Glycine max 
FEATURE : 

OTHER INFORMATION: Clone ID: PAT_MRT3847_2863C. 1 .pep 
US- 10-424 -599-206658 

Query Match 63.9%; Score 46; DB 4; Length 166; 

Best Local Similarity 61.5%; Pred. No. 13; 

Matches 8; Conservative 3; Mismatches 2; Indels 0; Gaps 0; 
Qy 1 DDGFVLDGGYDSE 13 



Db 




Search completed: December 17, 2005, 00:17:34 
Job time : 69.7143 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein 



protein search, using sw model 



Run on: 



December 16, 2005, 23:53:24 



; Search time 3.48214 Seconds 
(without alignments) 
25.151 Million cell updates/sec 



Title: 

Perfect score: 



US-10-789-494B-8 
72 



Sequence : 



1 DDGFVLDGGYDSE 13 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 51470 seqs, 6736768 residues 

Total number of hits satisfying chosen parameters: 51470 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published_Applications_AA_New: * 

1 : /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB.pep: * 

2 : /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB.pep:* 

3 : /cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB.pep: * 

4 : /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB.pep: * 

5 : /cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB.pep: * 

6 : /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB . pep : * 

7 : /cgn2_6/ptodata/l/pubpaa/USll_NEW_PUB .pep : * 

8 : /cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB.pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 
No. 


Score 


Query 

Match Length 


DB 


ID 


Description 


1 


40 


55 


.6 


473 


6 


US-10-467-657-486 


Sequence 


486, App 


2 


38 


52 


. 8 


816 


7 


US-11-090-439-48 


Sequence 


48, Appl 


3 


37 


51 


.4 


95 


6 


US-10-467-657-8782 


Sequence 


8782, Ap 


4 


35 


48 


.6 


88 


7 


US-11-053-076-179 


Sequence 


179, App 


5 


35 


48 


.6 


94 


7 


US-11-053-076-184 


Sequence 


184, App 


6 


35 


48 


.6 


348 


6 


US-10-878-556A-111 


Sequence 


111, App 


7 


35 


48 


.6 


713 


6 


US-10-467-657-1012 


Sequence 


1012, Ap 


8 


35 


48 


.6 


713 


7 


US-11-190-799-2 


Sequence 


2, Appli 


9 


35 


48 


.6 


713 


7 


US-11-190-799-4 


Sequence 


4, Appli 


10 


35 


48 


6 


1267 


7 


US-'ll-109-156-35 


Sequence 


35, Appl 


11 


34 


47 


2 


192 


6 


US-10-467-657-6808 


Sequence 


6808, Ap 


12 


34 


47 


2 


376 


7 


US-11-055-822-152 


Sequence 


152, App 


13 


34 


47 


2 


507 


6 


US-10-467-657-1612 


Sequence 


1612, Ap 


14 


34 


47 


2 


1044 


7 


US-11-091-668-2 


Sequence 


2, Appli 


15 


34 


47 


2 


1313 


7 


US-11-091-668'4 


Sequence 


4, Appli 


16 


34 


47 


2 


7102 


7 


US-11-143-980-48 


Sequence 


48, Appl 


17 


34 


47 


2 


7968 


7 


US-11-143'980-49 


Sequence 


49, Appl 


18 


33 


45 


8 


124 


6 


US-10-467-657-3838 


Sequence 


3838, Ap 


19 


33 


45, 


8 


133 


6 


US-10-467-657-38 


Sequence 


38, Appl 


20 


33 


45. 


8 


133 


6 


US-10-467-657-8452 


Sequence 


8452, Ap 


21 


33 


45. 


8 


257 


6 


US-10-467-657-7456 


Sequence 


7456, Ap 


22 


33 


45. 


8 


282 


6 


US-10-878-556A-125 


Sequence 


125, App 



23 


33 


45 


8 


430 


6 


US- 


10 


-793 


-626-130 


Sequence 


130, App 


24 


33 


45 


8 


436 


7 


US- 


11 


-116 


-939-9 


Sequence 


9, Appli 


25 


33 


45 


8 


466 


6 


US- 


10 


-467 


-657-2360 


Sequence 


2360, Ap 


26 


33 


45 


.8 


471 


6 


us- 


10 


-995 


-561-901 


Sequence 


901, App 


27 


33 


45 


8 


678 


7 


us- 


11 


-102 


-240-34 


Sequence 


34, Appl 


28 


33 


45 


8 


782 


6 


us- 


10 


-821 


-234-1592 


Sequence 


1592, Ap 


29 


33 


45 


8 


824 


7 


us- 


11 


-116 


-939-11 


Sequence 


11, Appl 


30 


32.5 


45 


1 


423 


6 


us- 


10 


-467 


-962B-85 


Sequence 


85, Appl 


31 


32 


44 


4 


35 


6 


us- 


10 


-467 


-657-7098 


Sequence 


7098, Ap 


32 


32 


44 


4 


35 


6 


us- 


10 


-467 


-657-8030 


Sequence 


8030, Ap 


33 


32 


44 


4 


78 


7 


us- 


11 


-000 


-463-469 


Sequence 


469, App 


34 


32 


44 


4 


78 


7 


us- 


11 


-000 


-463-941 


Sequence 


941, App 


35 


32 


44 


4 


125 


6 


us- 


10 


-467 


-657-5290 


Sequence 


5290, Ap 


36 


32 


44 


4 


155 


6 


us- 


10 


-467 


-657-7322 


Sequence 


7322, Ap 


37 


32 


44 


4 


158 


6 


us- 


10 


-467 


-657-7326 


Sequence 


7326, Ap 


38 


32 


44 


4 


227 


6 


us- 


10 


-467 


-657-970 


Sequence 


970, App 


39 


32 


44 


4 


263 


6 


us- 


10 


-467 


-657-284 


Sequence 


284, App 


40 


32 


44 


4 


263 


6 


us- 


10 


-467 


-657-5298 


Sequence 


5298, Ap 


41 


32 


44 


4 


322 


6 


us- 


10 


-793 


-626-1912 


Sequence 


1912, Ap 


42 


32 


44 


4 


334 


7 


us- 


11 


-055 


-822-24 


Sequence 


24, Appl 


43 


32 


44 


4 


348 


7 


us- 


11 


-102 


-497-3 


Sequence 


3, Appli 


44 


32 


44 


4 


348 


7 


us- 


11 


-102 


-497-11 


Sequence 


11, Appl 


45 


32 


44 


4 


360 


6 


us- 


10 


-467 


-657-5376 


Sequence 


5376, Ap 



ALIGNMENTS 



RESULT 1 

US-10-467-657-486 

; Sequence 486, Application US/10467657 

; Publication No. US20050260581A1 

; GENERAL INFORMATION: 

; APPLICANT: CHIRON SpA 

; APPLICANT: FONTANA Maria Rita 

APPLICANT: PIZZA Mariagrazia 
; APPLICANT: MASIGNANI Vega 
; APPLICANT: MONACI Elisabetta 

; TITLE OF INVENTION: GONOCOCCAL PROTEINS AND NUCLEIC ACIDS 
; FILE REFERENCE: 

; CURRENT APPLICATION NUMBER: US/lO/467 , 657 

; CURRENT FILING DATE: 2003-08-11 

; PRIOR APPLICATION NUMBER: GB- 0103424. 8 

; PRIOR FILING DATE: 2 001-02-12 

; NUMBER OF SEQ ID NOS : 9218 

SOFTWARE: SeqWin99, version 1.04 
; SEQ ID NO 486 

LENGTH: 473 

TYPE: PRT 
; ORGANISM: Neisseria gonorrhoeae 
US-10-467-657-486 

Query Match 55.6%; Score 40; DB 6; Length 473; 

Best Local Similarity 77.8%; Pred. No. 9.9; 

Matches 7; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 



Qy 



1 DDGFVLDGG 9 



MM IM 

Db 417 DDGFCADGG 425 



Search completed: December 17, 2 005, 00:17:55 
Job time : 4.48214 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



December 16, 2005, 23:33:32 ; Search time 13.4643 Seconds 

(without alignments) 
92.899 Million cell updates/sec 

US-10-789-494B-8 
72 

1 DDGFVLDGGYDSE 13 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



283416 seqs, 96216763 residues 



Total number of hits satisfying chosen parameters: 



283416 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 4 5 summaries 



Database : PIR_80:* 

1: pirl:* 

2: pir2:* 

3 : pir3 : * 

4: pir4:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result. being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


72 


100 


0 


2639 


2 


T31328 


fibroin - Chinese 


2 


45 


62 


5 


173 


2 


F84630 


hypothetical prote 


3 


44 


61 


1 


546 


2 


A69484 


hypothetical prote 


4 


44 


61 


1 


938 


2 


AF1772 


internal in- like pr 


5 


43 


59 


7 


657 


2 


B84791 


hypothetical prote 


6 


42 


58 


3 


223 


2 


D84233 


hypothetical prote 


7 


42 


58 


3 


249 


2 


G87105 


probable amidotran 



8 


42 


58 


.3 


867 


2 


G69485 


DNA-directed RNA p 


3 


41 


56 


. 9 


819 


2 


C71544 


leucine- tRNA ligas 


10 


41 


56 


. 9 


1196 


2 


A29130 


beta-amylase (EC 3 


11 


40 


55 


.6 


42 


2 


F81852 


hypothetical prote 


12 


40 


55 


.6 


520 


2 


151556 


recombination acti 


13 


40 


55 


.6 


706 


2 


S53035 


probable lysophosp 


14 


39 


54 


.2 


217 


2 


G85356 


hypothetical prote 


15 


39 


54 


.2 


329 


1 


G69210 


conserved hypothet 


16 


39 


54 . 


.2 


331 


1 


C69026 


acetylpolyamine am 


17 


39 


54 


.2 


381 


2 


AGOllO 


probable exported 


18 


39 


54 . 


.2 


482 


2 


A34948 


cycl in-related eel 


19 


39 


54 , 


.2 


482 


2 


S01153 


cell division cont 


20 


39 


54, 


.2 


499 


2 


AI1107 


internal in E [impo 


21 


39 


54 , 


.2 


528 


2 


S42510 


Rag-2 protein - ch 


22 


39 


54 , 


.2 


686 


2 


S43562 


K08E5.3 protein - 


23 


39 


54, 


.2 


813 


2 


G96494 


protein F7F22.4 [i 


24 


39 


54 . 


.2 


855 


2 


A53296 


DNA mismatch repai 


25 


39 


54 , 


.2 


889 


1 


S47162 


DNA-directed RNA p 


26 


39 


54, 


.2 


938 


2 


A56731 


chromatin assembly 


27 


38 


52, 


.8 


99 


2 


T09998 


12 -oxophytodienoat 


28 


38 


52, 


.8 


211 


2 


E87632 


conserved hypothet 


29 


38 


52 . 


.8 


229 


2 


AC3419 


transcription regu 


30 


38 


52, 


.8 


261 


2 


G87535 


regulatory protein 


31 


38 


52, 


.8 


284 


2 


AE3368 


probable thiosulfa 


32 


38 


52, 


.8 


292 


2 


E90180 


conserved hypothet 


33 


38 


52, 


.8 


337 


2 


AB3440 


acetylspermidine d 


34 


38 


52, 


.8 


360 


1 


S59311 


alcohol dehydrogen 


35 


38 


52 , 


.8 


407 


2 


T40582 


hypothetical prote 


36 


38 


52, 


.8 


428 


2 


G81362 


3 -phosphoshikimate 


37 


38 


52. 


.8 


428 


2 


JC5338 


3 -phosphoshikimate 


38 


38 


52, 


.8 


551 


2 


C89134 


protein F25G6.6 [i 


39 


38 


52. 


.8 


694 


2 


S10632 


replication initia 


40 


38 


52. 


,8 


820 


2 


A86510 


leucyl tRNA synthe 


41 


38 


52. 


.8 


820 


2 


C72113 


leucine- tRNA ligas 


42 


38 


52. 


.8 


821 


2 


AB1126 


internalin, peptid 


43 


38 


52 . 


.8 


970 


2 


E70533 


probable sulfatase 


44 


38 


52 . 


.8 


1070 


2 


AI0484 


probable autotrans 


45 


38 


52. 


.8 


5069 


2 


T17464 


rifamycin polyketi 



ALIGNMENTS 



RESULT 1 
T31328 

fibroin - Chinese oak silkmoth 

C; Species: Antheraea pernyi (Chinese oak silkmoth) 

C;Date: 22-Oct-1999 #sequence_revision 22-Oct-1999 #text_change 09-Jul-2004 
C;Accession: T31328 

R;Se2utsu, H.; Tamura, T. ; Yukuhiro, K. 
submitted to the EMBL Data Library, August 1998 

A; Description : Characterization of the full length fibroin gene of a wild 
silkworm, Antheraea pernyi . 
A; Reference number: Z20995 

A /Accession: T31328 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 



A/Residues: 1-2639 <SEZ> 

A; Cross -references: UNIPROT: 076786 ; UNIPARC:UPI0000078D8E; EMBL: AF083334 ; 
NID:g3450882; PID: g3450883 ; PIDN : AAC32606 . 1 
C; Genetics : 
A;Introns: 14/3 

Query Match 100.0%; Score 72; DB 2; Length 2639; 

Best Local Similarity 100.0%; Pred. No. 0.0019; 

Matches 13; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy • 1 DDGFVLDGGYDSE 13 

Illllllllllll 
Db 2579 DDGFVLDGGYDSE 2591 



RESULT 2 
F84630 

hypothetical protein At2g23940 [imported] - Arabidopsis thaliana 
C;Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Feb-2001 #sequence_revision 02-Feb-2001 #text_change 09-Jul-2004 
C;Accession: F84630 

R;Lin, X.; Kaul , S.; Rounsley, S.D.; Shea, T.P.; Benito, M.I.; Town, CD.; 
Fujii, C.Y.; Mason, T.M.; Bowman, C.L.; Barnstead, M.E.; Feldblyum, T.V,; Buell, 
C.R.; Ketchum, K.A, ; Lee, J.J.; Ronning, CM.; Koo, H. ; Moffat, K.S.; Cronin, 
L.A.; Shen, M. ; VanAken, S.E.; Umayam, L. ; Tallon, L.J.; Gill, J.E.; Adams, 
M.D.; Carrera, A.J.; Creasy, T.H.; Goodman, H.M.; Somerville, C.R.; Copenhaver, 
G.P.; Preuss, D.; Nierman, W.C; White, 0.; Eisen, J. A.; Salzberg, S.L.; Fraser, 
CM. ; Venter, J.C 
Nature 402, 761-768, 1999 

A; Title: Sequence and analysis of chromosome 2 of the plant Arabidopsis 
thaliana. 

A;Reference number: A84420; MUID:20083487; PMID: 10617197 

A; Access ion: F84 630 
A; Status : preliminary 
A; Molecule type: DNA 
A;Residues: 1-173 <STO> 

A; Cross -references : UNIPROT: 082222; UNIPARC:UPIO0000A5B51; GB:AE002093; 
NID:g3738323; PIDN : AAC63664 . 1 ; GSPDB : GN0013 9 
C; Genetics : 
A;Gene: At2g23940 
A;Map position: 2 

Query Match 62.5%; Score 45; DB 2; Length 173; 

Best Local Similarity 63.6%; Pred. No. 3; 

Matches 7; Conservative 3; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 DDGFVLDGGYD 11 

III ::|||:| 
Db 75 DDGELIDGGFD 85 



Search completed: December 16,. 2005, 23:53:17 
Job time : 14.4643 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 



December 16, 2005, 23:33:02 ; Search time 83.1071 Seconds 

(without alignments) 
110.362 Million cell updates/sec 

US-10-789-494B-8 

72 

1 DDGFVLDGGYDSE 13 



Scoring table: 



BL0SUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



2166443 seqs, 705528306 residues 



Total number of hits satisfying chosen parameters: 



2166443 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



UniProt_05.80:* 
1 : uniprot_sprot : * 
2 : uniprot_trembl : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



No. 


Score 


Match Length DB 


ID 


Description 


1 


72 


100 


.0 


151 


2 


Q95VQ0_ANTYA 


Q95vq0 


antheraea y 


2 


72 


100 


.0 


436 


2 


Q967T8_ANTPE 


Q967t8 


antheraea p 


3 


72 


100 


.0 


2639 


2 


076786_ANTPE 


076786 


antheraea p 


4 


72 


100 


.0 


2655 


2 


Q964F4_ANTyA 


Q964f4 


antheraea y 


5 


71 


98 


.6 


421 


2 


Q93119_ANTPE 


Q93119 


antheraea p 


6 


49 


68 


.1 


685 


2 


Q7VDT2_PROMA 


Q7vdt2 


prochloroco 


7 


47 


65, 


.3 


425 


2 


Q6D6K1 ERWCT 


Q6d6kl 


erwinia car 


8 


47 


65, 


.3 


428 


2 


Q6D355_ERWCT 


Q6d355 


erwinia car 


9 


46 


63- 


.9 


910 


2 


Q6B3 90_9PROT 


Q6b3 90 


uncultured 


10 


45 


62. 


.5 


173 


2 


Q8GYK4_ARATH 


Q8gyk4 


arabidopsis 


11 


45 


62 . 


.5 


173 


2 


Q8LEX3_ARATH 


Q81ex3 


arabidopsis 


12 


45 


62, 


.5 


173 


2 


082222 ARATH 


082222 


arabidopsis 


13 


44 


61. 


.1 


392 


2 


Q4LYV7_9BURK 


Q41yv7 


burkholderi 


14 


44 


61. 


.1 


458 


2 


Q5WHT0 BACSK 


Q5wht0 


bacillus cl 


15 


44 


61. 


, 1 


546 


2 


O28405_ARCFU 


028405 


archaeoglob 


16 


44 


61, 


, 1 


938 


2 


Q927R4 LISIN 


Q927r4 


listeria in 


17 


43 


59. 


.7 


56 


2 


Q5VQN1_0RYSA 


Q5vqnl 


oryza sativ 


18 


43 


59. 


7 


238 


2 


Q5TPW5 ANOGA 


Q5tpw5 


anopheles g 


19 


43 


59. 


7 


267 


2 


Q7QLB3_ANOGA 


Q7qlb3 


anopheles g 



20 


43 


59 


.7 


390 


2 


Q6T1G6_NE0FS 


Q6tlg6 


neoceratodu 


21 


43 


59 


.7 


478 


2 


Q82HT1_STRAW 


Q82htl 


streptomyce 


22 


43 


59 


.7 


490 


2 


Q90XH9_9SARC 


Q90xh9 


protopterus 


23 


43 


59 


.7 


601 


2 


Q7Q2X2_ANOGA 


Q7q2x2 


anopheles g 


24 


43 


59 


.7 


657 


2 


Q9ZUT5_ARATH 


Q9zut5 


arabidopsis 


25 


43 


59 


.7 


729 


2 


Q54RD7_DICDI 


Q54rd7 


dictyosteli 


26 


43 


59 


.7 


1228 


2 


Q4FYU1 LEIMA 


Q4fyul 


leishmania 


27 


42 


58 


.3 


223 


2 


Q9HRC6_HALSA 


Q9hrc6 


halobacteri 


28 


42 


58 


.3 


249 


2 


Q9CBU9 MYCLE 


Q9cbu9 


mycobacteri 


29 


42 


58 


.3 


364 


2 


Q5SPG9_BRARE 


Q5spg9 


brachydanio 


30 


42 


58 


.3 


390 


2 


Q6T1G5_LEPPA 


Q6tlg5 


lepidosiren 


31 


42 


58 


.3 


421 


2 


Q90XI2_9CHON 


Q90xi2 


triakis sp. 


32 


42 


58, 


.3 


459 


2 


Q5L207_GEOKA 


Q51207 


geobacillus 


33 


42 


58 , 


.3 


486 


2 


Q90XI3_CHIPU 


Q90xi3 


chiloscylli 


34 


42 


58, 


.3 


510 


2 


Q5PEW7 SALPA 


Q5pew7 


salmonella 


35- 


42 


58, 


.3 


511 


1 


GUAA_MYCPE 


Q8ews9 


mycoplasma 


36 


42 


58 . 


.3 


520 


2 


Q89YV0_BACTN 


Q89yv0 


bacteroides 


37 


42 


58, 


.3 


586 


2 


Q98J75_RHILO 


Q98j75 


rhizobium 1 


38 


42 


58. 


.3 


626 


2 


Q8I1Z9_PLAF7 


Q8ilz9 


Plasmodium 


39 


42 


58. 


.3 


647 


2 


Q8LDD0_ARATH 


Q81dd0 


arabidopsis 


40 


42 


58. 


.3 


647 


2 


Q9FM96_ARATH 


Q9fm96 


arabidopsis 


41 


42 


58 . 


.3 


823 


2 


Q723P8_LISMF 


Q723p8 


listeria mo 


42 


42 


58. 


.3 


866 


2 


Q7S7S5_NEUCR 


Q7s7s5 


neurospora 


43 


42 


58. 


3 


867 


2 


028391_ARCFU 


028391 


archaeoglob 


44 


42 


58 . 


,3 


881 


2 


Q9C261 NEUCR 


Q9C261 


neurospora 


45 


42 


58. 


,3 


1529 


2 


Q4SHR3_TETNG 


Q4shr3 


tetraodon n 
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DT 
DT 
DE 
GN 
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OC 
OX 
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RL 
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RA 
RL 
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FT 
SQ 



PRT; 



151 AA. 



RESULT 1 

Q95VQ0_ANTYA 

ID Q95VQ0_ANTYA PRELIMINARY; 
AC Q95VQ0; 

DT Ol-DEC-2001 {TrEMBLrel . 19, 
Ol-DEC-2001 (TrEMBLrel. 19, 
Ol-OCT-2003 (TrEMBLrel. 25, 
Fibroin heavy chain (Fragment) . 
Name=Fib-H; 

Antheraea yamamai (Japanese oak silkmoth) . 

Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 
Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Bombycoidea; 
Saturniidae; Saturniinae; Saturniini; Antheraea. 
NCBI TaxID=7121; 



Created) 

Last sequence update) 
Last annotation update) 



[1] 

NUCLEOTIDE SEQUENCE. 
Zurovec M . , Yang C . , 
Submitted (SEP-2001) 
[2] 

NUCLEOTIDE SEQUENCE. 
Fedic R. ; 

Submitted (AUG-2001) 
EMBL; AF410906; AAL02118.1; 
NON_TER 1 1 

SEQUENCE 151 AA; 13629 MW; 



Sehnal F. ; 

to the EMBL/GenBank/DDBJ databases. 



to the EMBL/GenBank/DDBJ databases. 
mRNA. 



596775A00040475A CRC64 ; 



Query Match 100.0%; Score 72; DB 2; Length 151; 

Best Local Similarity 100.0%; Pred. No. 0.00052; 
Matches 13; Conservative 0; Mismatches 0; Indels 

Qy 1 DDGFVLDGGYDSE 13 

lllllllllllll 
Db 91 DDGFVLDGGYDSE 103 
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